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COMPOSITIONS AND METHODS INVOLVING AN ESSENTIAL 
STAPHYLOCOCCUS AUREUS GENE AND ITS ENCODED PROTEIN 

RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Patent Application No, 09/470,512, filed 
December 22, 1999. 

FIELD OF THE INVENTION 

The invention relates to bacterial and bacteriophage genes. 

BACKGROUND OF THE INVENTION 

The Staphylococci make up a medically important genera of microbes known to cause 
several types of diseases in humans. S, aureus is a Gram positive organism which can be found 
on the skin of healthy human hosts. It is responsible for a large number of bacteremias, where its 
portal of entry can be the skin, lungs, urinary tract or infected intravascular devices (Steinberg et 
al, (1996) ) Clin. Infect. Dis. 23: 255-259; R^der et al, (1999) Arch. Intern. Med 159: 462^ 
469). It can cause fatal endocarditis or damage to the heart and, due to its exotoxin, can cause 
death via "Toxic Shock" (Frimodt-M0ller et al, (1997) Clin. Microbiol. Infect. 3: 297-305; 
Sanabria et al, (1990) Arch. Intern. Med ISO: 1305-1309). 

Only S. aureus and Staphylococcus epidermidis, of the nineteen species of 
Staphylococcus described in Berge^s Manual (1992), have significant interactions with humans. 
They are among the normal flora of humans, and are found on nasal passages, skin and mucous 
membranes. 5. aureus^ when pathogenic in humans, can cause a number of suppurative (pus- 
forming) infections, as well as food poisoning, endocarditis, and toxic shock syndrome. 

S. aureus causes superficial skin lesions, such as boils, styes and furunculosis; more 
serious infections kiclude pneumonia, mastitis, phlebitis, meningitis, and urinary tract infections, 
in addition to osteomyelitis and endocarditis. S, aureus is also a major cause of hospital 
acquired (nosocomial) infection of surgical wounds and infections associated with inserted and 
implanted medical devices. Lastly, 5; aureus causes food poisoning through the release of 
enterotoxins into food, and toxic shock syndrome through the release of superantigens into the 
blood stream. S, aureus also secretes two types of toxin with superantigen activity: 1) 
enterotoxins, of which there are six antigenic types (named SE-A, B, C, D, E and G) and 2) toxic 
shock syndrome toxin (TSST4). 
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S. aureus has been successfully treated with the penicillin derivative Methicillin in the 
past, but is now becoming increasmgly resistant (MRSA - Methicillin Resistant S. aureus) to this 
antibiotic (Harbath et al, (1998) Arch. Intem. Med. 158: 182-189.). For example, 5. aureus 
endocarditis mortality can range from 26-45%, and combined B-lactam/aminoglycoside therapy 
5 is proving mcreasingly ineffective in disease eradication (Roder et ah, (1999) Arch. Intern. Med. 
159: 462-469). However, MRSA infections continue to be sensitive to treatment with 
vancomycin, which is the drug of last resort, hifections caused by MRSA have been increasing 
in children and adults; isolates have been foxmd in 97% of all large, university-based teaching 
hospitals in the United States. Since 1996, three cases of vancomycin resistant S. aureus have 

10 been reported. This new strain represents a particularly dangerous development of an aggressive 
bacterial pathogen which does not respond to any known antibiotic. The emergence of resistance 
to vancomycin has the potential to result in untreatable (and thus fatal) S. aureus infections. 

It is no longer uncommon to isolate S. aureus strains which are resistant to most of the 
standard antibiotics, and thus there is an unmet medical need and demand for new anti-microbial 

1 5 agents, vaccines, drug screening methods, and diagnostic tests for this organism. 

SUMMARY OF THE INVENTION 

The present invention relates to Dnal and Dnal-related proteins, in particular S. aureus 
Dnal polypeptides and dnal polynucleotides, recombinant materials and methods for their 

20 production. The invention also relates to a pair of interacting proteins, a growth-inhibitory (or 
inhibitor) bacteriophage 77 ORF 104 gene product that interacts with the S. aureus Dnal 
polypeptide, the interacting regions of the S, aureus Dnal related protein and the protein encoded 
by the 51 aureus bacteriophage 77 ORF 104, forming the basis for screening assays. It also 
relates to polynucleotides and polypeptides of a multiprotehi complex believed to be involved in 

25 initiation of DNA replication containing Dnal as a subunit, and also may include DnaC and 
related proteins, as well as variants of them. In another aspect, the invention relates to methods 
for using such polypeptides and polynucleotides, including treatment of microbial diseases, 
amongst others. In a further aspect, the invention relates to methods for identifying agonists and 
antagonists using the materials provided by the invention, and for treating microbial infections 

30 and conditions associated with such infections with the identified agonist or antagonist 
compounds. In a still further aspect, the invention relates to diagnostic assays for detecting 
diseases associated with microbial infections and conditions associated with such infections, 
such as assays for detecting Dnal expression or activity. 
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The invention encompasses a method of identifying a compound that is active on a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 16. The method comprises 
contacting a candidate compound with the polypeptide, and detecting bindmg of the candidate 
compound to the polypeptide, wherein detection of binduig is indicative that the compound is 
5 active on the polypeptide. 

In one embodiment, the step of detecting comprises the step of measuring the binding of 
a candidate compound, wherein the compound is directly or indirectly detectably labeled, to the 
polypeptide. 

In another embodiment, the step of detecting comprises measurement by phage display. 
10 hi another embodiment, the step of detecting comprises measurement by surface plasmon 

resonance. 

In another embodiment, the step of detecting comprises measurement by FRET. 
In another embodiment, the step of detecting comprises measurement of fluorescence 
polarization changes, 

15 In another embodiment, the step of detecting comprises a scintillation proxhnity assay. 

In another embodiment, the step of detecting comprises a biosensor assay. 

In another embodiment, the active compound is selected from the group consisting of a 
small molecule, a peptidomimetic compound, and a fragment or derivative of a bacteriophage 
inhibitor protein. 

20 In another embodmient, the active compound is a peptide synthesized by a recombinant 

expression system and purified, or artificially synthesized. 

The invention also encompasses a method of identifying a compound that is active on a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 16, the method comprising the 
steps of contacting a first and a second polypeptide m the presence and absence of a candidate 

25 compound, wherein the first polypeptide comprises the amino acid sequence of SEQ ID NO: 16 
or a fragment or variant thereof that specifically binds phage 77ORF104 and the second 
polypeptide comprises phage 77 ORF 104 or a domain thereof that specifically binds a 
polypeptide of SEQ ID NO: 16, and detecting the binding of the first and second polypeptides to 
each other, wherein a decrease in the binding of the first and the second polypeptides m the 

30 presence of the candidate compound relative to the bhiding in the absence of the candidate 
compound identifies the candidate compound as a compound that is active on a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 16. 

In one embodiment, the fhst or the second polypeptide is directly or indirectly detectably 
labeled. 
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In another embodiment, the step of detecting comprises measurement by phage display. 
In another embodiment, the step of detecting comprises measurement by surface plasmon 
resonance. 

In another embodiment, the step of detecting comprises measurement by FRET. 
In another embodiment, the step of detecting comprises measurement of fluorescence 
polarization changes. 

In another embodiment, the step of detecting comprises a scintillation proximity assay. 

In another embodiment, the step of detecting comprises a biosensor assay. 

The invention fijrflier encompasses an agonist or an antagonist of the activity of a Dnal 
polypeptide or a gene encoding the polypeptide. 

The invention further encompasses a method of identifying a compound that is active on 
a Dnal polypeptide, comprising the steps of contacting a candidate compound with cells 
expressing a polypeptide comprising SEQ ID NO: 16 and detecting Dnal activity in the cells, 
wherein a decrease in activity relative to Dnal activity in cells not contacted with a candidate 
compound is indicative of inhibition of Dnal activity. 

The invention further encompasses a method of making an antibacterial compound, 
comprising the steps of: a) determining whether a candidate compound is active on a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 16 or a gene encoding the polypeptide; and 
b) synthesizing or purifying the candidate compound in aa amount sufficient to provide a 
therapeutic effect when administered to an organism infected by a bacterium naturally producing 
a polypeptide comprising the amino acid sequence of SEQ ID NO: 16. 

In one embodiment, the candidate compound is selected from the group consisting of a 
small molecule, a peptidomimetic compound, and a fragment or derivative of a bacteriophage 
inhibitor protein. 

In one embodiment, the candidate compound is a peptide synthesized by a recombinant 
expression system and purified, or artificially synthesized. 

The invention further encompasses a method for inhibiting a bacterium, comprising 
contacting the bacterium with a compoxmd active on a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 16 or a gene encoding the polypeptide. 

In one embodiment, the step of contacting is performed in vitro. 

In another embodiment, the step of contactmg is performed in vivo in an animal 

In another embodiment, the compound is selected firom the group consisting of a small 
molecule, a peptidomimetic compound, and a fragment or derivative of a bacteriophage inhibitor 
protein. 
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In another embodiment, the compoxmd is a peptide synthesized by a recombinant 
expression system and purified, or is artificially synthesized. 

The invention further encompasses a method for treating a bacterial infection in an 
animal suffering from an infection, comprising administering to the animal a therapeutically 
5 effective amount of a compound active on a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 16 or a gene encoding the polypeptide. The animal is preferably, but not 
necessarily a mammal, more preferably a human. 

In one embodiment, the compound is selected from the group consisting of a small 
molecule, apeptidomimetic compound, and a bacteriophage inhibitor protein* 
10 The invention further encompasses a method of prophylactic treatment to prevent 

bacterial infection comprising contacting an indwelling device with a compound active on a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 16 before its implantation into 
a mammal, such contacting being sufficient to prevent S. aureus irrfection at the site of 
implantation. 

1 5 The invention further encompasses a method of prophylactic treatment to prevent 

infection of an animal by a bacterium comprising administering to the animal a compound that 
is active on a polypeptide comprising the amino acid sequence of SEQ ID NO: 16 or a gene 
encoding the polypeptide in an amount sufficient to reduce adhesion of the bacterium to a tissue 
surface of the mammal. 

20 The invention further encompasses a method of diagnosing in an individual an infection 

with Staphylococcus aureus^ comprismg: determining the presence in the individual of a 
polypeptide comprising the amino acid sequence of SEQ ID NO; 16. 

In one embodiment, the determining step comprises contacting a biological sample of the 
individual with an antibody specific for an epitope present on a polypeptide comprising the 
25 amino acid sequence of SEQ ID NO: 16. 

The invention fiirther encompasses a method of diagnosing in an individual an infection 
with Staphylococcus aureus, comprising determining the presence in the individual of a nucleic 
acid sequence encoding a polypeptide comprising the amino acid sequence of SEQ ID NO: 16. 
In one embodiment, the determining step comprises contacting a nucleic acid sample of 
30 said individual with an isolated, purified or enriched nucleic acid probe of at least 15 nucleotides 
in length that hybridizes under stringent hybridization conditions with the sequence of SEQ ID 
NO: 1, or the complement of such probe. 
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The invention further encompasses an isolated, purified or enriched polynucleotide 
comprising a nucleotide sequence that has at least 55% identity to the sequence of SEQ ID NO: 
1, or the complement of said nucleotide sequence. 

The invention further encompasses an isolated, purified or enriched polynucleotide 
5 consisting ofnucleotides 448-942 ofSEQ ID NO: 1, herein referred to as SEQ ID NO: 17, 
comprising a nucleotide sequence encoding the polypeptide of SEQ ID NO: 16 or the 
complement of such nucleotide sequence. 

The invention further encompasses an isolated, purified or enriched polynucleotide 
consisting of the sequence of SEQ ID NO: 17. 
10 The invention fiirther encompasses an isolated, purified or emiched polypeptide having at 

least 55% identity to the amino acid sequence of SEQ ID NO: 16. 

The invention further encompasses an isolated, purified or enriched polypeptide of at 
least 50 amino acids m length having at least 50 % identity to the amino acid sequence of SEQ 
ID NO: 16. 

15 The invention fiirther encompasses an isolated, purified or enriched polypeptide having at 

least 70% similarity to the amino acid sequence of SEQ ID NO: 16. 

The invention fiirther encompasses an isolated, purified or enriched polypeptide of at 
least 20 amino acids in length having at least 60% similarity to the amino acid sequence of SEQ 
ID NO: 16. 

20 The invention further encompasses an isolated polypeptide comprising the amino acid 

sequence of SEQ ID NO: 16. 

The invention further encompasses an isolated polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 16. 

The invention further encompasses an isolated, purified or enriched antibody specific for 
25 a polypeptide comprising SEQ ID NO: 1 6. 

The invention further encompasses a composition comprismg two polypeptides, a 
bacteriophage 77 ORE 104 polypeptide and a polypeptide comprising the amnio acid sequence 
of SEQ ID NO: 16 or a variant thereof that specifically bmds phage 77 ORE 104 polypeptide. 
The invention further encompasses a composition comprising a nucleic acid encoding 
30 bacteriophage 77 ORE 104 and a nucleic acid comprising SEQ ID NO: 17 or a variant thereof 
that encodes a polypeptide that specifically bmds bacteriophage 77 ORE 104 polypeptide. 

Further features and advantages of the invention will become more fully apparent in the 
following description of the embodiments and drawings thereof, and fi-om the claims. 
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BRIEF DESCRIPTION OF THE DRAWTNOS 

Fig, 1 shows the nucleotide (A; SEQ ID NO: 1) and amino acid (B; SEQ ID NO: 2) 
sequences of S. aureus Dnal. 

Fig, 2 shows the complete nucleotide sequence of the S, aureus bacteriophage 77 genome 
5 (SEQ ED NO: 3). 

Fig. 3 shows an ORF map of the S. aureus bacteriophage 77 genome. 

Fig. 4 shows the nucleotide (A; SEQ ID NO: 4) and the amino acid (B; SEQ ID NO: 5) 
sequences ofS. aureus bacteriophage 77 ORF 104. 

Fig. 5 shows the predicted tryptic peptide masses of the ORF identified in the University 
10 of Oklahoma S aureus genomic database that closely matches the tryptic peptide profile of the 
polypeptide bound by 77ORFI04. 

Fig, 6 shows alignments of A subtilis DnaC sequences with the homologous sequences 
from S. aureus. A) shows an alignment of 5, subtilis dnaC polynucleotide sequence (SEQ ID 
NO: 6) with the homologous S. aureus dnaC polynucleotide sequence (SEQ ID NO: 7) identified 
15 by BLAST searching the *S. aureus database at http://www.tigr.org with the B. subtilis dnaC 

sequence, B) shows an alignment of 5. subtilis DnaC amino acid sequence (SEQ. ID NO: 8) with 
the predicted amino acid sequence of the polypeptide (SEQ ID NO: 9) encoded by the S. aureus 
dnaC polynucleotide sequence shown in Fig. 6A. 

Fig, 7 shows the kilhng potential of bacteriophage 77 ORF 104 and the e:?q)ression vector 
20 used to induce its expression in aureus: A) Schematic diagram of expression vector pT/ORF 
used to induce expression of 77ORF104 in S. aureus cells; B) Results of a screen to assess the 
killing potential of 77ORF104 when expressed in S. aureus grown on semi-solid support media; 
and C) Results showing the inhibitory potential of 77ORF104 when expressed in 5', aureus in 
liquid media. 

25 Fig, 8 shows affinity chromatography using GST and GST/ORF104 as ligands with the 5. 

aureus extract prepared by French pressure cell lysis and sonication. Eluates from affinity 
columns containing the. GST and GST/ORF104 ligands at 0, 0.1, 0.5, 1.0, and 2.0 mg/ml resm 
were resolved by 12.5% SDS-PAGE, Proteins were visualized by silver staining. Micro-columns 
were eluted with: A) ACB containing I M NaCl; B) 250 mM NaCl; C) 1% Triton X-100; and D) 

30 1% SDS. Each molecular weight marker (Mr) is approximately 100 ng. The lanes labeled ACB 
indicate eluates from a 2.0 mg/ml Ugand column loaded only with ACB buffer containing 75 
mM NaCl. The arrows indicate bands specifically interacting with GST/ORF104. 

Fig. 9 shows affinity chromatography with GST and GST/ORF104 as ligands with the S, 
aureus extract prepared by lysis with lysostaphin digestion and sonication. Eluates from affinity 
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columns containing the GST and GST ORF104 ligands at 0, 0.1, 0.5, 1,0, and 2.0 mg/ml resin 
were resolved by 12.5% SDS-PAGE, Micro-columns were sequentially eluted with 75 mM ACB 
containing 1% Triton X-100, 250 mM NaCl, 1 M NaCl ACB, and 1% SDS. The elution profile 
obtained with 1% SDS is shown. Each molecular weight marker (Mr) is approximately 100 ng. 
5 The lanes labeled ACB indicate eluates from a 2.0 mg/ml ligand column loaded only with ACB 
buffer containing 75 mM NaCl Lanes labeled C and L are corresponding elutions from columns 
containing GST and GST/ORF104 at 2.0 mg/ml from Figure 8. The arrow indicates a 
polypeptide specifically interacting with GST/ORF104. 

Fig. 10 shows affinity chromatography with ORF104 (GST removed) as ligand with the 

10 S. aureus extract prepared by lysis with lysostaphin digestion and sonication (Lys extract) and 
French pressure cell and sonication (FP/S extract). Eluates from affmity colmtms containing the 
ORF104 ligand at 0, 0.1, 0.5, 1.0, and 2.0 mg/ml resin were resolved by 12.5% SDS-PAGE and 
the gel was stained with silver nitrate. Micro-columns were sequentially eluted with: ACB 
containing 1% Triton X400; 250 mM NaCl; IM NaCl; and 1% SDS. The elution profile 

15 obtained with 1% SDS is shown. Each molecular weight marker (Mr) is approxhnately 100 ng. 
The lanes labeled ACB mdicate eluates from a 2.0 mg/ml ligand column loaded only with ACB 
buffer containing 75 mM NaCl, The arrow indicates a polypeptide specifically interacting with 
GST/ORF104. 

Fig. 11 shows results of a tryptic peptide mass spectrum analysis showing relatedness 
20 between the interacting protein eluted with Triton X-100 (indicated by arrow in Fig. 8C) and the 
interacting protein eluted with 1% SDS (mdicated by arrow in Fig. 8D). Of note are the tryptic 
peptides having monoisotopic MH+ masses of 1351.8, 1412.7, and 1617.8 Da. 

Fig. 12 shows the results of yeast two hybrid analyses designed to test the interaction of 
5. aureus Dnal and 77 ORF 104. A) Constmction of the yeast pGADT7 vector expressing the 
25 polypeptide Gal4 activation domain (GAD) fi:ised to the S. aureus Dnal. B) Construction of the 
yeast pGBKT7 vector expressing the polypeptide Gal4 DNA binding domain (GBK) fused to 
phage 77 ORF104. 77 ORF 104 and Dnal were also cloned into pGADT7 and pGBKT7, 
respectively (not shown). C) Yeast two-hybrid assay, D) Yeasts were co-transformed, as 
mdicated (No 1 to 6), in the presence or in the absence of control vectors. pGADT7-T and 
30 pGBKT7-53 (NO 1) are positive control for proteinrprotein intemction and pCLl (NO 4) is an 
active Gal4 transcription factor. Co-transformants were plated in parallel on yeast synthetic 
medium (SD) supplemented with amino acid drop-out lacldng tryptophan and leucine (TL 
minus) and on SD supplemented with amino acid drop-out lacking tryptophan, histidine, adenine 
and leucine (THAL minus). Co-transformants harboring 77 ORF104 polypeptide only grew on 
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selective THAL minus media in the presence of Dnal (NO 5 and 6). E) Results of the 
luminescent p-galactosidase enzymatic assays with protein extracts from the same co- 
transformants (NO 1 to 6), 

Fig. 13 shows inhibition of S, aureus DNA synthesis by bacteriophage 77 ORF 104 
5 protein. 

Fig 14 shows the interaction between partial proteolysis fragments of Dnal and ORF 104 
from 5. aureus bacteriophage 77. Partial proteolytic fragments generated by A) endoproteinase 
GluC or B) chymotrypsin were subjected to affinity chromatography using columns containing 
either 0 or 2.0 mg/ml of 77ORF104 protein. Partial proteolytic fragments interacting with the 

10 77ORF104 and not the control coliunn were excised for peptide mapping. Lanes are indicated as 
Mr, molecular weight markers; L, load; FT, flowthrough; 1, 1 M NaCl elution; 2, 1% SDS 
elution; ACB, affinity chromatography buffer. The interacting bands excised for peptide 
mapping are indicated according to the apparent Mr by SDS-PAGE, bands not interacting are 
indicated with (-). C) List of identified Dnal proteolytic fragments interactmg with 77 ORF 104. 

1 5 Partial proteolytic fragments interacting with 770RF 1 04 were purified by reverse phase, 

analyzed with MALDI-TOF, and the observed high molecular weight fragments mapped to the 
corresponding amino acid sequence of SEQ ID NO: 2^ The minimal domain of Dnal interacting 
with 77 ORF 104 as determined by partial proteolysis with chsanotrypsin is amino acids 131 to 
3 13 and with endoproteinase Glu-C is amino acids 119 to 313 of SEQ ID NO: 2, 

20 Fig. 15 shows the amino acid sequence of the Dnal fragments tested in yeast two-hybrid 

system for interaction with 77ORF104. SEQ ID NO: 16 contains the amino acids 150 to 3 13 of 
SEQ ID NO: 2 and SEQ ID NO: 17 contams the corresponding nucleotides 448 to 942 of SEQ 
ID NO: 1. SEQ ID NO: 18 contams the amino acids 64 to 313 of SEQ ID NO: 2, 

Fig, 16 shows the results of the yeast two-hybrid analysis that were designed to test the 

25 interaction between fragments of Dnal and 77 ORF 104. Fragments of S. aureus Dnal were 
cloned into pGADT? vector. Yeasts were co-transformed with the plasmids hidicated from No 1 
to 6. pGBKLam and 77pGADORF13 are control vectors driving the expression of non- 
interacting proteins. Co-transformants were plated in parallel on THAL minus SD medium and 
on TL minus SD medium. Co-transformants bearing 77ORF104 only grew on selective THAL 

30 minus media in the presence of Dnal or Dnal fragments (No 1, 3 and 5), D) Representation of 
fragments of Dnal interacting with 77ORF104. The mirimaal domain of Dnal interacting with 77 
ORF 104 as determined by yeast two-hybrid analysis is amino acids 150 to 313. 
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DESCRTPTTON OF THE INVENTION 

The invention is based on the discovery of aa essential gene and its encoded polypeptide 
in S. aureus and portions thereof useful in screening, diagnostics, and therapeutics. The 
invention also relates to S. aureus Dnal polypeptides and polynucleotides as described in greater 
5 detail below, and to a pair of polynucleotides encoding a pair of interacting polypeptides, and the 
pair of polypeptides themselves, or interacting domains thereof, where the pair mcludes a S. 
aureus Dnal polypeptide and a 77 ORF 104 polypeptide. Also, the invention relates to 
polynucleotides and polypeptides of a protein complex, thought to be involved in initiation of 
DNA repUcation, contaming Dnal and DnaC related proteins, as well as their variants. In 

1 0 particular, the invention relates to polypeptides and polynucleotides of a Dnal of S, aureus, 
which is related by amino acid sequence homology to B. subtilis Dnal polypeptide. The 
invention relates especially to Dnal having the nucleotide and amino acid sequences disclosed as 
SEQ ID NO: 1 and SEQ ID NO: 2, respectively. The sequences presented as SEQ ID NOs: 1 and 
2 represent an exemphfication of the invention, since those of ordinary skill will recognize that 

15 such sequences can be usefully employed hi polynucleotides in general, including 
ribopolynucleotides. 

We have used the methodology of two previous inventions (U.S, Patent Application 
Serial No. 09/407.804, filed Sept 28, 1999, and U.S. Provisional Patent AppUcation 60/110,992 
filed December 3, 1998) to identify and characterize an essential polynucleotide and polypeptide 

20 sequence from 5. aureus. Thus, the present invention provides polynucleotide and polypeptide 
sequences isolated from S. aureus that can be used in a drug screening assay to identify 
compounds with anti-microbial activity. The polynucleotide and polypeptide sequences can be 
isolated using a method similar to those described herein, or using another method* In addition, 
such polynucleotide and polypeptide sequences can be chemically synthesized. 

25 Definitions 

The phrase "active on", with reference to a particular cellular target, such as the product 
of a particular gene, means that the target is an important part of a cellular pathway which 
includes that target and that an agent or compound acts on that pathway. Thus, in some cases the 
agent or compound may act on a component upstream or downstream of the stated target, 
30 includmg a regulator of that pathway or a component of that pathway. In general, an 

antibacterial agent is active on an essential cellular fimction, often on a product of an essential 
gene. 

As used herem, the terms "mhibit", "inhibition", ^inhibitory", and "mhibitor" all refer to a 
function of reducing a biological activity or fimction. Such reduction in activity or function can. 
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for example, be in connection with a cellular component (e.g., an enzyme), or in connection with 
a cellular process (e.g., synthesis of a particular protein), or in connection with an overall process 
of a cell (e.g., cell growth). In reference to cell growth, the inhibitory effects may be 
bacteriocidal (killing of bacterial cells) or bacteriostatic (i.e. - stopping or at least slowing 
bacterial cell growth). The latter slows or prevents cell growth such that fewer cells of the strain 
are produced relative to uninhibited cells over a given time period. From a molecular standpoint, 
such inhibition may equate with a reduction in the level of, or elimination of, the transcription 
aud/or translation of a specific bacterial target(s), or reduction or elimination of activity of a 
particular target biomolecule. 

As used herein, the tern "Dnal polypeptide" refers to a polypeptide encorapassmg S, 
aureus Dnal (SEQ ID NO: 2) or an active domain ofS. aureus Dnal. As used herein, the term 
"active domain of 5. aureus Dnal" is a polypeptide fragment or portion ofS. aureus Dnal that 
retains an activity of S. aureus DnaL The term "Dnal polypeptide" is meant to encompass 
aureus Dual or an active domain of S. aureus Dnal that is fused to another, non-Dnal 
polypeptide sequence. 

"Dnal activity" is defined as one or more of the following: 

A) The activity of a polypeptide having the S, aureus Dnal sequence provided herein, a 
flragment or analog thereof or a protein comprising a S. aureus Dnal polypeptide that directly 
interacts with bacteriophage 77 ORF 104 protein or a Dnal-binding fragment thereof in a manner 
that results in at least a 10-fold reduction of ^H-thymidine incorporation in a bacterial DNA 
replication assay relative to ^H4hymidine incorporation in an assay lacking bacteriophage 77 
ORF 1 04 or a Dnal-bindiag fi-agment thereof. 

To assay for Dnal activity by ^H-thymidine incorporation, the level of radiolabeled 
thymidine incorporation into DNA is measured in S. aureus cells expressmg an arsenite- 
inducible 77 ORF104 construct in the presence or absence of 5 joM sodium arsenite. Samples 
(0.5 ml) are withdrawn from cultures at appropriate time intervals and mixed with 4,5 |li1 of 
labeling solution (0.2 fiCi/ml of ^H-thymidine (73 Ci/nmiol, NEN Life Science Products, Inc.) 
and 70 pmol of unlabeled thymidine). After 15 minutes of reaction, incotporation is stopped by 
addmg 5 |li1 of 0.2% NaNs and 5 ^il of 30 fig/ml unlabeled thymidine. Samples are precipitated 
with 10% (w/v) trichloroacetic acid and filtered through glass fiber filters (GF-C, Whatman), 
The results are expressed as -thymidine counts hicorporated, normalized to the OD of the 
culture. 
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B) The activity of a polypeptide having the S. aureus DNA sequence provided herein, or 
a fragment or analog thereof, or a protein comprising a & aureus Dnal polypeptide that is 
necessary for at least a 10% inhibition of plasirdd replication by bacteriophage 77 ORF 104 
protein in the plasmid DNA replication assay. This assay is as follows, the plasmid pC194 
5 repUcates in S, aureus by a rolling circle mechanism. The single-stranded origin, sso, of pC194 
is involved in the synthesis of the lagging strand of DNA. The plasmid pADG6406 is a 
derivative of pC194 lacking awo. The absence of sso leads to the accumulation of single-shanded 
plasmid DNA. The single stranded initiation site, ssiA^ is located on the lagging strand of pAM 
1, and is a site for replicative primosome assembly. SsiA was inserted into plasmid pADG6406. 

10 & aureus cells carrying plasmids are grown to mid-log phase and their total DNA is extracted 
and analyzed by Southern hybridization using ^^P-labeled plasmid DNA as probe. The presence 
of pADG6406 with ssiA is associated with a decrease in the ratio of smgle-stranded to double- 
stranded plasmid DNA compared to the ratio in cells bearing the same plasmid lacking the ssiA 
insert. This system is used to measure the effect of 77 ORF 104 expression on single-stranded 

15 DNA synthesis. A plasmid containmg 77 ORF 104 under an arsenite-inducible promoter is 
transformed into S. aureus harboring pADG6406. The ratio of single-stranded to double- 
stranded DNA of pADG6406 is measured in the presence and absence of sodium arsenite. An 
increase in the ratio of single-stranded to double-stranded DNA of 10% or more in the presence 
of 77 ORF 104 indicates an effect on Dnal activity. 

20 C) The activity of a polypeptide having the S. aureus sequence provided herein, a 

fragment or analog thereof, or a protein comprising a S. aureus Dnal polypeptide in the loading 
of iX aureus DnaC helicase onto replicative primosomes. The following helicase assay can be 
adapted from an in vitro assay with SPPl phage G38P (DnaA), G39P (Dnal) and G40P (DnaC) 
polypeptides (Ayora et al, 1999, J. Mol. Biol. 288: 71-85) Helicases are capable of unwmding 

25 DNA with a 5' to 3 * unwinding polarity. To determine the role of S. aureus Dnal on the helicase 
unwindmg activity, an annealed substrate with a 3' single-stranded (ss) DNA tail (preformed 
fork) is incubated with a constant quantity of purified DnaC helicase and increasing amounts of 
either purified Dnal, DnaA or preformed DnaA-Dnal complex. The reaction mixture is subjected 
to conditions that support helicase activity. The reaction contains 50 mM NaCl, 1 mM ATP, 50 

30 |ig/ml BSA and 0.24 nM ^^P-labeled oligomer annealed to M13 ssDNA offered as substrate. The 
DNA molecule in the reaction mixture is analyzed for whether it is converted to single-stranded 
(ss) DNA, The reaction is stopped by the addition of 5 \xl of stopping solution (100 mMEDTA, 
2% (w/v) SDS in DNA loading buffer (Sambrook 1989)) and subsequently loaded onto a 10% 
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non-denaturing PAGE gel The gel is run and dried prior to autoradiography. The ratio of the 
oUgo released from the M13 ssDNA is evaluated. 

D) The binding or kiteraction of a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 16, provided herem, to bacteriophage 77 OKF 104 protein or a portion thereof 
5 capable of binding a polypeptide comprismg the amino acid sequence of SEQ ID NO. 16. The 
interaction or bmding of a polypeptide comprising the ammo acid sequence of SEQ ID NO, 16 
and a bmding portion of bacteriophage 77 ORF 104 may be between isolated polypeptides 
consisting essentially of the sequence necessary for binding, or, alternatively, the respective 
polypeptide sequence may be comprised within a larger polypeptide, A number of methods, 

10 useful in the invention, to measure the binding of bacteriophage 77 ORF 104 to a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 16 are described below. For example, 
Phage display is a powerful quantitative assay to measure proteinrprotein interaction using 
colorimetric ELISA (enzyme-linked immunosorbent assay). Surface plasmon resonance assays 
can be used as a quantitative method to measure binding between two molecules by the change 

15 in mass near an immobilized sensor caused by the binding of one protein from the aqueous phase 
to a second immobilized on the sensor. An additional useful binding assay is Fluorescence 
Resonance Energy Transfer (FRET), in which the close proximity of two fluorophores, each 
bound to a separate molecule, causes the excitation spectrum of one fluorophore to overlap with 
the excitation spectrum of the second, and thus dual fluorescence following excitation of only 

20 one fluorophore is indicative of binding, An additional assay useful in the present invention is 
fluorescence polarization, in which the quantifiable polarization value for a given fluorescently- 
tagged molecule is altered upon binding to a second molecule. A scintillation proximity assay 
can also be used to measure binding of a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 16 and bacteriophage 77ORF104, in which the emmitance of radioactive particles 

25 is altered upon binding. Additionally, binding can be evaluated by a Bio Sensor assay, which is 
based on the ability of the sensor to register changes in admittance induced by ion-channel 
modulation following binding, A further assay which can be used to measure the binding of a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 16 and bacteriophage 77 ORF 
104 is the yeast two hybrid assay, in which the binding of the two polypeptides within the 

30 context of two fusion protems expressed in yeast cells, permits the expression of reporter 
molecules which, in turn produces a measurable, or detectable signal. 

The activity of the dnal gene is defined as the expression of an RNA encoding a S. aureus 
Dnal polypeptide according to the invention. 
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As used herein, the term "polynucleotide encoding a polypeptide" or equivalent language 
encompasses polynucleotides that include a sequence encoding a polypeptide of the invention, 
particularly a bacterial polypeptide and more particularly a polypeptide of *S. aureus Dnal protein 
having an amino acid sequence set out in Fig. 1, SEQ ID NO: 2, The term also encompasses 
5 polynucleotides that include a single continuous region or discontinuous regions encoding the 
polypeptide (for example, polynucleotides interrupted by integrated phage, an integrated 
insertion sequence, an integrated vector sequence, an integrated transposon sequence, or due to 
RNA editing or genomic DNA reorganization) together with additional regions that also may 
contain coding and/or non-coding sequences* 

10 As used herein, the term "dnal gene" is meant to encompass a polynucleotide encoding a 

S, aureus Dual polypeptide. Any additional nucleotide sequences necessary to direct 
transcription of KNA encoding a aureus Dnal polypeptide, either in a cell or in vitro, will be 
termed "regulatory sequences", which include hut are not limited to transcriptional promoters 
and enhancers, and transcription temiinators. 

15 As used herein, the term "ORF 104" or "phage 77 ORF 104" or "77ORF104" 

encompasses a pol)mucleotide having the sequence provided in Fig. 4 (SEQ ID No: 4), which 
encodes a gene product known as the 77 ORF 104 gene product. 

As used herein, the term "polynucleotide(s)" generally refers to any polyribonucleotide or 
poly-deoxyribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 

20 "Polynucleotide(s)" include, without limitation, single- and double-stranded DNA, DNA that is a 
mixture of single- and double-stranded regions or single-, double- and triple-stranded regions, 
single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded 
regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded, or triple-stranded regions, or a mixture of single- and double-stranded 

25 regions. In addition, "polynucleotide" as used herein refers to triple-stranded regions comprising 
RNA or DNA or both RNA and DNA. The strands m such regions may be from the same 
molecule or from different molecules. The regions may include all of one or more of the 
molecules, but more typically involve only a region of some of the molecules. One of the 
molecules of a triple-helical region often is an oligonucleotide. As used herein, the term 

30 "polynucleotide(s)"also includes DNAs or RNAs as described above that contain one or more 
modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons 
are "polynucleotide(s)" as that term is intended herein. Moreover, DNAs or RNAs comprismg 
unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two 
examples, are polynucleotides as the term is used herem. It will be appreciated that a great 



wo 01/46383 



15 



PCT/USOO/35180 



variety of modifications have been made to DNA and KNA that serve many useM purposes 
known to those of skill in the art. The term "polynucleotide(s)" as it is employed herein 
embraces such chemically, en2ymatically or metabolically modified forms of polynucleotides, as 
well as the chemical forms of DNA and RNA characteristic of viruses and cells, including, for 
5 example, simple and complex cells. "Polynucleotide(s)" also embraces short polynucleotides 
often referred to as oligonucleotide(s). 

As used herein, the term "polypeptide(s)" refers to any peptide or protein comprising two 
or more amino acids joined to each other by peptide bonds or modified peptide bonds, 
''Polypeptide(s)" refers to both short chains, commonly referred to as peptides, oligopeptides and 

10 oligomers and to longer chains generally referred to as proteins. Polypeptides may contain amino 
acids other than the 20 gene-encoded amino acids, "Polypeptide(s)" include those modified 
either by natural processes, such as processing and other post-translational modifications, but 
also by chemical modification techniques. Such modifications are well described in basic texts 
and in more detailed monographs, as well as in a volumuious research literature, and they are 

15 well known to those of skill in the art. It will be appreciated that the same type of modification 
may be present in the same or varying degree at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Modifications can occm- anywhere 
in a polypeptide, including the peptide backbone, the amino acid side-chains, and the amino or 
carboxyl termini. Modifications include, for example, acetylation, acylation, ADP-ribosylation, 

20 amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of cysteine, formation of pyroglutamate, formylation, 
gamma-carboxylation, GPI anchor formation, hydroxylation, iodination, methylation, 

25 myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, racemization, 
glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, 
hydroxylation, selenoylation, sulfation and transfer-RNA mediated addition of amino acids to 
proteins, such as arghiylation, and ubiquitination. See, for instance: PROTEINS - STRUCTURE 
AND MOLECULAR PROPERTIES, 2nd Ed., T. K Creighton, W. H. Freeman and Company, 

30 New York (1993); Wold, F., Posttranslational Protein Modifications: Perspectives and Prospects, 
pgs. 1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. 
Johnson, Ed., Academic Press, New York (1983); Seifter et al, Meth. Enzymol. 182:626-646 
(1990); and Rattan et al. Protein Synthesis: Posttranslational Modifications and Agmg, Ann. 
N.Y. Acad. Sci. 663: 48-62(1992). Polypeptides may be branched or cyclic, with or without 
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branching. Cyclic, branched and branched circular polypeptides may result from post- 
translational natural processes and may be made by entirely synthetic methods, as well. 

As used herein, the term "variant(s)" refers to a polynucleotide or polypeptide that differs 
from a reference polynucleotide or polypeptide, respectively, but retains one or more of the 
5 biological activities of Dnal as described herein* A typical variant of a polynucleotide differs in 
nucleotide sequence from another, reference polynucleotide* Changes in the nucleotide sequence 
of the variant may or may not alter the amino acid sequence of a polypeptide encoded by the 
reference polynucleotide. Nucleotide changes may result in amino acid substitutions, additions, 
deletions, and truncations in the polypeptide encoded by the reference sequence, or m the 

10 formation of fusion proteins, as discussed below. A typical variant of a polypeptide differs in 
amino acid sequence from another, reference polypeptide. Generally, differences are limited so 
that the sequences of the reference polypeptide and the variant are closely similar overall and, in 
many regions, identical. A variant and reference polypeptide may differ in amino acid sequence 
by one or more substitutions, additions, deletions in any combination, A substituted or inserted 

15 amino acid residue may or may not be one encoded by the genetic code. The present invention 
also includes variants of each of the polypeptides of the invention, that is polypeptides that vary 
from the referents by conservative amino acid substitutions whereby a residue is substituted by 
another with hke characteristics. Typically, such substitutions are among Val, Leu and lie; 
among Ser and Thr; among the acidic residues Asp and Glu; and among the basic residues Lys 

20 and Arg; or aromatic residues Phe and Tyr. Particularly preferred are variants in which l-IO, 1-5, 
1-3, 2-3, or 1 amino acid or amino acids are substituted, deleted, or added in any combmation. A 
variant of a polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, 
or it may be a variant that is not known to occur naturally. Non-naturally occmring variants of 
polynucleotides and polypeptides may be made by mutagenesis techniques, by direct synthesis, 

25 and by ottier recombinant methods known to skilled artisans. 

. As uses herein, the term "fragment", when used in reference to a polypeptide, is a variant 
polypeptide having an amino acid sequence that is entirely the same as part but not all of the 
amino acid sequence of Dnal polypeptide according to the invention. As with 5. aureus Dnal 
polypeptides, fragments may be "free-standing" ("consisting of), or comprised within a larger 

30 polypeptide of which they form a part or region, most preferably as a single continuous region in 
a single larger polypeptide. 

The term "isolated", when used in reference to a nucleic acid means that a naturally 
occurring sequence has been removed from its normal cellular (e.g., chromosomal) environment 
or is synthesized in a non-natural enviroimient artificially synthesized). Thus, the sequence 
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may be in a cell-free solution or placed in a different cellular environment. The term does not 
imply that the sequence is the only nucleotide chain present, but that it is essentially free (about 
90-95% pure at least) of non-nucleotide material naturally associated with it, and thus is 
distinguished from isolated chromosomes, 
5 The term "enriched", when used in reference to a polynucleotide means that the specific 

DNA or RNA sequence constitutes a significantly higher fraction (2-5 fold) of the total DNA or 
RNA present in the cells or solution of interest than in normal or diseased cells or in cells from 
which the sequence was originally taken. This could be caused by a person by preferential 
reduction in the amount of other DNA or RNA present, or by a preferential increase in the 

10 amount of the specific DNA or RNA sequence, or by a combination of the two. However, it 
should be noted that enriched does not imply that there are no other DNA or RNA sequences 
present, just that the relative amount of the sequence of interest has been significantly increased. 

As used herein, the term "significantly higher fraction" indicates that the level of 
enrichment is usefiJ to the person making such an enrichment and indicates an increase 

15 enrichment relative to other nucleic acids of at least about 2-fold, or 5- to 10-fold or even more. 
The term also does not imply that there is no DNA or RNA from other sources. The other source 
DNA may, for example, comprise DNA from a yeast or bacterial genome, or a cloning vector 
such as pUC19, This term distinguishes from naturally occurring events, such as viral infection, 
or tumor type growths, in which the level of one mRNA may be naturally increased relative to 

20 other species of mRNA. That is, the term is meant to cover only those situations in which a 
person has hitervened to elevate the proportion of the desired nucleic acid. 

As used herein, the term "purified" in reference to nucleic acid does not require absolute 
purity (such as a homogeneous preparation). Instead, it represents an indication that the 
sequence is relatively more pure than in the natural environment (compared to the natural level, 

25 this level should be at least 2-5 fold greater, e.g. , in terms of mg/mL), Individual clones isolated 
from a genomic or cDNA library may be purified to electrophoretic homogeneity. The claimed 
DNA molecules obtained from these clones could be obtained directly from total DNA or from 
total RNA. cDNA clones are not naturally occurring, but rather are preferably obtained via 
manipulation of a partially purified naturally occurring substance (messenger RNA). The 

30 construction of a cDNA Ubrary from mRNA involves the creation of a synthetic substance 
(cDNA) and pure individual cDNA clones can be isolated from the synthetic library by clonal 
selection of the cells canymg the cDNA library. Thus, the process which includes the 
construction of a cDNA Ubrary from niRNA and isolation of distinct cDNA clones yields an 
approximately lO^-fold purification of the native message over its proportion in naturally 
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occurring cells. Thus, purification of at least one order of magnitude, preferably two or three 
orders, and more preferably four or five orders of magnitude is expressly contemplated. A 
genomic library can be used in the same way and yields the same approximate levels of 
purification. 

The terms "isolated", "enriched", and "purified" used with respect to nucleic acids, 
above, may similarly be used to denote the relative purity and abundance of polypeptides. These, 
too, may be stored in, grown in, screened in, and selected fi-om libraries using biochemical 
techniques familiar in the art. Such polypeptides may be natural, synthetic or chimeric and may 
be extracted using any of a variety of methods, such as antibody immunoprecipitation, other 
"tagging" techniques, conventional chromatography and/or electrophoretic methods. Some of the 
above utilize the corresponding nucleic acid sequence. 

As used herein, the term "complement" when used in reference to a given polynucleotide 
sequence refers to a sequence of nucleotides which can form a double-stranded heteroduplex m 
which every nucleotide in the sequence of nucleotides is base-paired by hydrogen bonding to a 
nucleotide opposite it in the heteroduplex with the given polynucleotide sequence. The term 
may refer to a DNA or an RNA sequence that is the complement of another RNA or DNA 
sequence. As used herein, the term "hybridizes" refers to the formation of a hydrogen-bonded 
heteroduplex between two nucleic acid molecules. Generally, a given nucleic acid molecule will 
hybridize with its complement, or with a molecule that is sufficiently complementary to the 
given molecule to permit formation of a hydrogen-bonded heteroduplex between the two 
molecules. 

As used herein, the term "probe" refers to a polynucleotide of at least 15 nucleotides (nt), 
20 nt, 30 nt, 40 nt, 50 nt, 75 nt, 100 nt, 200 nt, 500 nt, 1000 nt, and even up to 5000 to 10,000 nt 
in length, 

"Identity" and "similarity," as used herem and as known in the art, are relationships 
between two or more polypeptide sequences or two or more polynucleotide sequences, as the 
case may be, as determined by comparing the sequences. 

Amino acid or nucleotide sequence "identity" and "similarity" are determined from an 
optimal global alignment between the two sequences bemg compared. An optimal global 
alignment is achieved using, for example, the Needleman - Wunsch algorithm (Needleman and 
Wunsch, 1970, J. Mol. Biol. 48:443-453). "Identity" means that an amino acid or nucleotide at a 
particular position in a first polypeptide or polynucleotide is identical to a corresponding amino 
acid or nucleotide in a second polypeptide or polynucleotide that is in an optunal global 
alignment with the first polypeptide or polynucleotide. In contrast to identity, "similarity" 
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encompasses amino acids that are conservative substitutions. A "conservative" substitution is 
any substitution that has a positive score in the blosum62 substitution matrix (Hentikoff and 
Hentikoff, 1992, Proc. Natl. Acad. Sci. USA 89: 10915-10919). By the statement "sequence A is 
n% similar to sequence B" is meant that n% of the positions of an optimal global alignment 
5 between sequences A and B consists of conservative substitutions. By the statement "sequence 
A is n% identical to sequence B" is meant that n% of the positions of an optimal global 
alignment between sequences A and B consists of identical residues or nucleotides. Optimal 
global alignments in this disclosure used the following parameters in the Needleman-Wtmsch 
alignment algorithm: 
10 For polypeptides: 

Substitution matrix: blosum62. 
' Gap scoring function: -A -B*LG, where A=l 1 (the gap penalty), B=l (the gap 
length penalty) and LG is the length of the gap. 
For nucleotide sequences: 
15 Substitution matrix: 10 for matches, 0 for mismatches. 

Gap scoring function: -A -B*LG where A=50 (the gap penalty), B=3 (the gap 
length penalty) and LG is the length of the gap. 

Typical conservative substitutions are among Met, Val, Leu and He; among Ser and Thr; 
among the residues Asp, Glu and Asn; among the residues Gin, Lys and Arg; or aromatic 
20 residues Phe and Tyr. In calculating the degree (most often as a percentage) of similarity 

between two polypeptide sequences, one considers the number of positions at which identity or 
similarity is observed between correspondmg amino acid residues in the two polypeptide 
sequences m relation to the entire lengths of the tsvo molecules being compared. 

As used herein, the term "antibody" is meant to encompass constructions using the 
25 binding (variable) region of such an antibody, and other antibody modifications. Thus, an 
antibody useM in the invention may comprise a whole antibody, an antibody fragment, a 
polyftmctional antibody aggregate, or in general a substance comprising one or more specific 
binding sites from an antibody. The antibody fragment may be a fragment such as an Fv, Fab or 
F(ab*)2 fragment or a derivative thereof, such as a single chain Fv fragment. The antibody or 
30 antibody fragment may be non-recombinant, recombinant or humanized. The antibody may be 
of an immunoglobulin isotype, e.g., IgG, IgM, and so forth. In addition, an aggregate, polymer, 
derivative and conjugate of an immunoglobulin or a fragment thereof can be used where 
appropriate. Neutralizhig antibodies are especially useftil according to the invention for 
diagnostics, therapeutics and methods of drug screening and drug design. 
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As used herein, the term ^'specific for an epitope present on a 5. aureus Dnal 
polypeptide", when used in reference to an antibody, means that the antibody recognizes and 
binds an antigenic determinant present on a 5*. aureus Dnal polypeptide according to the 
invention. 

As used herein, the term "antigenically equivalent derivative(s)" encompasses a 
polypeptide, polynucleotide, or the equivalent of either which will be specifically recognized by 
certain antibodies which, when raised to the protein, polypeptide or polynucleotide according to 
the invention, interferes with the immediate physical interaction between pathogen and 
mammalian host. 

As used herem, the term "essential", when used in connection with a gene or gene 
product, means that the host cannot survive without, or is significantly growth compromised, in 
the absence or depletion of fiinctional product. An "essential gene" is thus one that encodes a 
product that is beneficial, or preferably necessary, for cellular growth in vitro in a medium 
appropriate for growth of a strain having a wild-type allele corresponding to the particular gene 
in question. Therefore, if an essential gene is inactivated or inhibited, that cell will grow 
significantly more slowly than a wild-type strain or even not at all. Preferably, growth of a strain 
in which such a gene has been inactivated will be less than 20%, more preferably less than 10%, 
most preferably less than 5% of the growth rate of the wild-type, or the rate will be zero, m the 
growth medium. Preferably, in the absence of activity provided by a product of the gene, the cell 
will not grow at all or will be non-viable, at least under culture conditions sunilar to normal in 
vivo growth conditions. For example, absence of the biological activity of certain enzymes 
involved in bacterial cell wall synthesis can result in the lysis of cells under normal osmotic 
conditions, even though protoplasts can be maintained under controlled osmotic conditions. 
Preferably, but not necessarily, if such a gene is inhibited, e.g., with an antibacterial agent or a 
phage product, the growth rate of the inhibited bacteria will be less than 50%, more preferably 
less flian 30%, still more preferably less than 20%, and most preferably less than 10% of the 
growth rate of the uninhibited bacteria. As recognized by those skilled in the art, the degree of 
growth inhibition will generally depend upon the concentration of the inhibitory agent In the 
context of the invention, essential genes are generally the preferred targets of antimicrobial 
agents. Essential genes can encode "target" molecules directly or can encode a product involved 
in the production, modification, or maintenance of a target molecule. 

As used herein, "target" refers to a biomolecule or complex of biomolecules that can be 
acted on by an exogenous agent or compound, thereby modulating, preferably inhibiting, growth 
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or viability of a bacterial pell. A target may be a nucleic acid sequence or molecule, or a 
polypeptide or a region of a polypeptide. 

As used herein, the term "signal that is generated by activation or inhibition of a S, 
aureus Dnal polypeptide" refers to the measurable indicator of Dual activity in an assay of Dnal 
5 activity. For example, ^H-thymidine DNA incorporation, plasmid replication, helicase loading, 
or simply signal resulting for binding of 77ORF104 to a Dnal polypeptide. 

As used herein, the tenn "standard", used in reference to polypeptide activity, means the 
amount of activity observed or detected (directly or indirectly) in a given assay performed in the 
absence of a candidate compound. A "standard" serves as a reference to determine the effect, 
10 positive or negative, of a candidate compound on polypeptide activity. 

A "candidate compound" as used herein, is any compound with a potential to modulate 
the expression or activity of a S, aureus Dnal polypeptide. 

As used herein, the term "increase in activity" refers to an enhanced level of measumble 
activity of a polypeptide in a given assay in the presence of a candidate compound relative to the 
15 measurable level of activity in tiie absence of a candidate compound. Activity is considered 
increased according to the invention if it is at least 10% greater, 20% greater, 50% greater, 75% 
greater, 100% greater or more, up to 2-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100-fold or more 
than in the absence of a candidate compound. 

As used herein, the term "decrease in activity" refers to a reduced level of measurable 
20 activity of a polypeptide in a given assay in the presence of a candidate compound relative to the 
measurable level of activity in the absence of a candidate compound. Activity is considered 
decreased according to the invention if it is at least 10% less, preferably 15% less^ 20% less, 50% 
less, 75% less, or even 100% less (i»e., no activity) than that observed in the absence of a 
candidate compound, 

25 As used herein, the term "conditions that permit their interaction", when used in 

reference to a S. aureus Dnal polypeptide and a candidate compound means that the two entities 
are placed together, whether both in solution or with one immobilized or restricted in some way 
and the other in solution, wherein the parameters (e.g., salt, detergent, protein or candidate 
compound concentration, temperature, and redox potential, among others) of the solution are 

30 such that the S. aureus Dnal polypeptide and the candidate compound may physically associate. 
Conditions that permit proteinxandidate interaction include, for example, the conditions 
described herein for Phage display, Surface Plasmon Resonance and FRET assays. 

As used herein, the term "detectable change in a measurable parameter of Dnal" refers to 
an alteration in a quantifiable characteristic of a »S aureus Dnal polypeptide. 
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As used herein^, the term "agonist'* refers to an agent or compound that enhances or 
increases the activity of a S. aureus Dnal polypeptide or polynucleotide. An agonist may be 
directly active on a *S1 aui^eus Dnal polypeptide or polynucleotide, or it may be active on one or 
more constituents in a pathway that leads to enhanced or increased activity ofaS, aureus Dnal 
polypeptide or polynucleotide. 

As used herein, the term "antagonist" refers to an agent or compound that reduces or 
decreases the activity of a 5. aureus Dnal polypeptide or polynucleotide. An antagonist may be 
directly active on a S. aureus Dnal polypeptide or polynucleotide, or it may be active on one or 
more constituents in a pathway that leads to reduced or decreased activity of a S. aureus Dnal 
polypeptide or polynucleotide. 

As used herein, the term "antibacterial agent" or "antibacterial compound" refers to an 
agent or compound that has a bacteriocidal or bacteriostatic effect on one or more bacterial 
strains, preferably such an agent or compound is bacteriocidal or bacteriostatic on at least S, 
aureus. 

As used herein, the term "synthesizing" refers to a process of chemically synthesizing a 
compound. 

As used in the context of treating a bacterial infection a "therapeutically effective 
amount", "pharmaceutically effective amoimt" or "amount sufficient to provide a therapeutic 
effect" indicates an amount of an antibacterial agent which has a therapeutic effect. This 
generally refers to the inhibition, to some extent, of the normal cellular functioning of bacterial 
cells required for continued bacterial mfection. Further, as used herein, a therapeutically 
effective amount means an amount of an antibacterial agent that produces the desired therapeutic 
effect as judged by clinical trial results and/or animal models. This amount can be routinely 
determined by one skilled in the art and will vary depending on several factors, such as the 
particular bacterial strain involved and the particular antibacterial agent used. In the same 
context, an "amount sufficient to reduce adhesion" of a bacterium to a tissue or tissue surface 
indicates an amount of an antibacterial agent that is effective for prophylactically preventing or 
reducing the extent of bacterial infection of the given tissue or tissue surface. 

As used herein, a "tissue" refers to an aggregation of cells of one or more cell types 
which together perform one or more specific flmctions in an organism. As used herein, a "tissue 
surface" refers to that portion of a tissue that forms a boundary between a given tissue and other 
tissues or the surroundings of the tissue. A tissue surface may refer to an external surface of an 
anunal, for example the skin or cornea, or, alternatively, the term may refer to a surface that is 
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either internal, for example, the lining of the gut, or to a surface that is exposed to the outside 
surroundings of the animal only as the result of an injury or a surgical procedure. 

As used herein, the term "measuring the binding of a candidate compound" refers to the 
use of an assay permitting tlie quantitation of the amount of a candidate compound physically 
associated with a S. aureus Dnal polypeptide. 

As used herein, the term "directly or indirectly detectably labeled" refers to the 
attachment of a moiety to a candidate compound that renders the candidate compound either 
directly detectable (e.g,, an isotope or a fluorophore) or indirectly detectable (e.g., an enzyme 
activity, allowmg detection in the presence of an appropriate substrate, or a specific antigen or 
other marker allowing detection by addition of an antibody or other specific indicator). 

As used herein, the term "small molecule" refers to compounds having molecular mass of 
less than 3000 Daltons, preferably less than 2000 or 1500, still more preferably less than 1000, 
and most preferably less than 600 Daltons. Preferably but not necessarily, a small molecule is 
not an oligopeptide. 

As used herein, the term "mimetic" refers to a compound that can be natural, synthetic, or 
chimeric and is structurally and fimctionally related to a reference compound. In terms of the 
present invention, a "peptidomtmetic," for example, is a non-peptide compound that mimics the 
activity-related aspects of the 3-dimensional structure of a peptide or polypeptide, for example a 
compound that mimics the stmcture of a peptide or active portion of a phage- or bacterial ORF- 
encoded polypeptide. 

As used herein, the term "bacteriophage inhibitor protein" refers to a protein encoded by 
a bacteriophage nucleic acid sequence, which inhibits bacterial function in a host bacterium. 
Thus, it is a bacteria-inhibiting phage product. The term "bacteriophage inhibitor protein" 
encompasses a fragment, derivative, or active portion of a bacteriophage inhibitor protein. 

As used herein, the term "active portion" refers to an epitope, a catalytic or regulatory 
domain, or a fragment of a bacteriophage inhibitor protein that is responsible for, or a significant 
factor in, bacterial target inhibition. The active portion preferably may be removed from its 
contiguous sequences and, in isolation, still effect inhibition. 

As used herein, the term "treatmg a bacterial infection" refers to a process whereby the 
growth and/or metabolic activity of a bacterium or bacterial population m a host, preferably a 
mammal, more preferably a human, is inhibited or ablated. 

As used herein, the term "bacterium" refers to a single bacterial strain and includes a 
single cell and a plurality or population of cells of that strain unless clearly indicated to the 
contrary. In reference to bacteria or bacteriophage, the term "strain" refers to bacteria or phage 
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having a particular genetic content The genetic content includes genomic content as well as 
recombinant vectors. Thus, for example, two otherwise identical bacterial cells would represent 
different strains if each contained a vector, eg. , a plasmid, with different inserts. 

As used herein, the term "diagnosing" refers to the identification of an organism or strain 
of an organism responsible for a bacterial infection. 

As used herein, the term "infection with Staphylococcus aureus"' refers to the presence, 
growth or proliferation of cells of a S. aureus strain within, or on a surface of, an animal, such as 
a mammal, preferably a human. 

As used herein,' the terra "bacteriophage 77 ORF 104-encoded polypeptide" refers to a 
polypeptide encoded by SEQ ID NO: 4 or to a fragment or derivative thereof encompassing an 
active portion of a bacteriophage 77 ORF 104-encoded polypeptide of sequence disclosed in 
SEQ ID NO: 5. 

As used herein, the term "DnaC" refers to a polypeptide of SEQ ID NO: 9, including that 
encoded by a polynucleotide of SEQ ID NO: 7 or to a fragment or derivative of such polypeptide 
encompassing an active portion otS. aureus DnaC. In this context, an active portion of 5. 
aureus DnaC refers to that fragment or portion of S'. aureus DnaC that mteracts with or is part of 
a complex including S, aureus Dnal or a fragment or derivative of iS. aureus DnaL 

As used herein, the term "polypeptide complex'* refers to a combination of two or more 
polypeptides in a physical association with each other. It is preferred that such a physical 
association be required for some aspect of the activity of one or more of the polypeptides in such 
a polypeptide complex. 

As used herein, the terra "physical association" refers to an interaction between two 
moieties involving contact between the two moieties. 

As used herein, the term "bodily material(s)" means any material derived from an 
individual or from an organism infecting, infesting or inhabiting an individual, including but not 
Umited to, cells, tissues and waste, such as, bone, blood, serum, cerebrospinal fluid, semen, 
saliva, muscle, cartilage, organ tissue, skm, urine, stool or autopsy materials. 

As used herein, the term "disease(s)" means any disease caused by or related to mfection 
by a bacterium, mcluding, for example, otitis media, conjunctivitis, pneumonia, bacteremia, 
meningitis, sinusitis, pleural empyema and endocarditis, and most particularly meningitis, such 
as for example infection of cerebrospmal fluid. 

As used herein, the term "fusion protein(s)" refers to a protein encoded by a gene 
comprising amino acid coding sequences from two or more separate proteins fused ia frame such 
that the protein comprises ftised amino acid sequences from the separate proteins. 
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As used herein, the term "host cell(s)" is a cell which has been transformed or 
transfected, or is capable of transformation or transfection by an exogenous polynucleotide 
sequence. 

As used herein, the term "immunologically equivalent derivative(s)" encompasses a 
5 polypeptide, polynucleotide, or the equivalent of either which when used in a suitable 

formulation to raise antibodies in a vertebrate^ results in antibodies that act to interfere with the 
immediate physical interaction between pathogen and mammalian host. 

As used herein, the term "immimospecific" means that characteristic of an antibody 
whereby it possesses substantially greater affinity for the polypeptides of the invention or the 
10 polynucleotides of the invention than its affinity for other related polypeptides or polynucleotides 
respectively, particxilarly those polypeptides and polynucleotides in the prior art. 

As used herein, the term "individual(s)" means a multicellular eukaryote, including, but 
not limited to a metazoan, a mammal, an ovid, a bovid, a simian, a primate, and a human. 

As used herein, the term "Organism(sy* means a (i) prokaryote, including but not limited 
15 to, a member of the genus Streptococcus, Staphylococcus, Bordetella, Corynebacterium, 
Mycobacterium, Neisseria, Haemophilus, Actinomycetes, Streptomycetes, Nocardia, 
Enterobacter, Yersinia, Fancisella, Pasturella, Moraxella, Acinetobacter, Erysipelothrix, 
Branhamella, Actinobacillus, Streptobacillus, Listeria, Calymmatobacterixnn, Brucella, Bacillus, 
Clostridixun, Treponema, Escherichia, Salmonella, Kleibsiella, Vibrio, Proteus, Erwinia, 
20 Borrelia, Leptospira, Spirillum, Campylobacter, Shigella, Legionella, Pseudomonas, Aeromonas, 
Rickettsia, Chlamydia, Borrelia and Mycoplasma, and further including, but not limited to, a 
member of the species or group. Group A Streptococcus, Group B Streptococcus, Group C 
Streptococcus, Group D Streptococcus, Group G Streptococcus, Streptococcus pneumoniae. 
Streptococcus pyogenes. Streptococcus agalactiae. Streptococcus faecalis. Streptococcus 
25 faecium. Streptococcus diuans. Neisseria gonorrheae. Neisseria meningitidis, Staphylococcus 
aureus, Staphylococcus epidermidis, Corynebacterium diptheriae, Gardnerella vaginalis, 
Mycobacterium tuberculosis, Mycobacterium bovis, Mycobacterium ulcerans, Mycobacterium 
leprae, Actinomyctes israelii, Listeria monocytogenes, Bordetella pertusis, Bordatella 
parapertusis, Bordetella bronchiseptica, Escherichia coli, Shigella dysenteriae, Haemophilus 
30 influenzae, Haemophilus aegyptius, Haemophilus parainfluenzae, Haemophilus ducreyi, 

Bordetella, Salmonella typhi, Citrobacter fireundii, Proteus niirabilis, Proteus vulgaris. Yersinia 
pestis, Kleibsiella pneumoniae, Serratia marcessens, Serratia liquefeciens, Vibrio cholera. 
Shigella dysenterii, Shigella flexneri, Pseudomonas aeruginosa, Franscisella tularensis, Brucella 
abortis, Bacillus antluracis, Bacillus cereus, Clostridiimi perfringens, Clostridium tetani. 
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Clostridium botulinmn, Treponema pallidum, Rickettsia rickettsii and Chlamydia trachomitis, 
(ii) an archaeon, including but not limited to Archaebacter, and (iii) a unicellular or filamenous 
eukaryote, including but not limited to, a protozoan, a fungus, a member of the genus 
Saccharomyces, BCiuveromyces, or Candida, and a member of the species Saccharomyces 
ceriviseae, Kluveromyces lactis, or Candida albicans. 

As used herein, the term "recombinant expression system(s)" refers to a system in which 
vectors comprising sequences encoding polypeptides of the invention or portions thereof, or 
polynucleotides of the invention are introduced or transformed into a host cell or host cell lysate 
for the production of the polynucleotides and polypeptides of the invention, 

As used herein, the term "artificially synthesized" when used in reference to a peptide, 
polypeptide or polynucleotide means that the amino acid or nucleotide subunits were chemically 
joined in vitro without the use of cells or polymerizing enzymes. The chemistty of 
polynucleotide and peptide synthesis is well known in the art. 

In addition to the standard single and triple letter representations for amino acids, the 
term "X" or "Xaa" may also be used in describing certain polypeptides of the invention. "X" and 
"Xaa" mean that any of the twenty naturally occurring amino acids may appear at such a 
designated position in the polypeptide sequence. 

As used herein, the term "specifically binding" in the context of the interaction of two 
polypeptides means that the two polypeptides physically interact via discrete regions or domains 
on the polypeptides, wherein the interaction is dependent upon the amino acid sequences of the 
uateracting domains. Generally, the equilibrium binding concentration of a polypeptide that 
specifically binds another is in the range of about 1 jiM or lower, preferably 100 nM or lower, 10 
nM or lower, 1 nM or lower, 100 pM or lower, and even 10 pM or lower. 

As used herein, the term "decrease in the binding" refers to a drop in the signal that is 
generated by the physical association between two polypeptides under one set of conditions 
relative to the signal under another set of reference conditions. The signal is decreased if it is at 
least 10% lower than the level under reference conditions, and preferably 20%, 40%, 50%, 75%, 
90%, 95% or even as much as 100% lower (i.e., no detectable interaction). 

As used herem, the term "detectable marker", when used in the context of a yeast two- 
hybrid assay, refers to a polypeptide that confers a trait upon a cell expressing that polypeptide 
that signals the presence or amount of that polypeptide expressed. Detectable markers are 
encoded on plasmids that may exist episomally or may be integrated into the genome of a host 
cell Detectable markers include, but are not limited to, polypeptides encoding enzymes 
allowing colorimetric or fluorescent detection (e.g., E. coli LacZ, which catalyzes the conversion 
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of the substrate analog X-gal to generate a blue color), polypeptides encoding enzymes 
conferring antibiotic resistance, and polypeptides encoding enzymes conferring the ability of a 
yeast strain to grow on medium lacking a given component (i.e., critical for the relief of 
auxotrophy). 

As used herein, the term "results in the expression of a detectable marker" means that the 
uxteraction of factors necessary to permit the expression of a detectable marker (e.g., two-hybrid 
transactivation domain and DNA binding domain fusion proteins) causes the transactivation and 
translation of detectable levels of a detectable marker. A "detectable level'' is that level of 
expression that can be differentiated from background expression occurring in the substantial 
absence of one or more factors or conditions necessary for marker expression. Detectable levels 
v^ill vary depending upon the nature of the detectable marker, but will generally consist of levels 
at least about 10% or more greater than the background level of a given marker. 

As used herein, the term "decrease in the expression" refers to a drop in the expression of 
a detectable marker under one set of conditions relative to the e?cpression under another set of 
reference conditions. The expression of a detectable marker is decreased if it is at least 10% 
lower than the level under reference conditions, and preferably 20%, 40%, 50%, 75%, 90%, 95% 
or even as much as 100% lower (i.e., not expressed). 
How to Identify a S. aureus dnal sequence : 

Using methodology described in detail in Example 1 and 2, a 5. aureus polypeptide that 
specifically bound the bacterial growth inhibitory 77 phage ORF 104 protein was isolated. 
Briefly, the 77ORF104 protein was used as a ligand m an affinity chromatography binding step 
with S. aureus protein extract. The selected 5. aureus interactmg polypeptide was purified and 
further analyzed by tryptic digestion and mass spectrometry. The sequence of a tryptic peptide of 
the S. aureus polypeptide, GHVPENVTDNDR (SEQ ID NO: 10), was used to BLAST search 
the 51 aureus nucleotide sequence in the University of Oklahoma S. aureus genomic database at 
http://www.genome.ou.edu/staph.html One sequence contig of 4850 nucleotides in length 
(Contig 981), when converted into amino acid sequence, contained within it the similar amino 
acid sequence GHVPELYVDNNR (SEQ ID NO: 1 1 ; Fig. 5). This tentative identification of the 
candidate protein was then confirmed upon in silica tryptic digestion of the open reading frame 
found in the contig (Fig. 5). The obtained PSD/CID spectra for tryptic peptides with 
monoisotopic MH+ masses of 135L8, 1412.7, and 1617.8 Da were similar to the predicted 
PSD/CID firagmentation patterns of &e tryptic peptides with monoisotopic MH+ masses of 
1351.8 and 1617.8 Da found in the contig*s +3 open reading firame (Fig. 5), 
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Comparison of the ORF of the 51 aurem contig that encodes a tryptic peptide similar to 
that identified in the S. aureus phage 77 ORF 104 binding studies with all other sequences in the 
public domain databases revealed that the ORF is related to the Dnal protem from Bacillus 
subtilis (Table 1) a protein implicated in chromosome replication. No other significant similarity 
was found with any other protein in publicly accessible databases. The degree of relatedness of 
the identified ORF to the B. subtilis Dnal protein shows 41% identity and 63% similarity (Table 
!)• 

Many genes otB. subtilis involved in DNA rephcation have been identified through ttie 
isolation of thermosensitive mutants. One of these, dnar2, affected an unknown step of 
chromosome replication at the restrictive temperature (Karamata, D. and Gross, J.D. (1970) Mol. 
Gene. Genet. 108, 277-287). The gene was mapped around 250° on the B subtilis chromosome 
and resides immediately downstream of the dnaB gene on the B. subtilis chromosome (Bruand, 
C. and Ehrlich, S.D. (1995) Microbiology 141, 1199-1200). The dnaI2 mutation has been 
characterized and resides within the dnal gene and consists of a G to A substitution at nucleotide 
position 922 (Fig. 1; SEQ ID NO: 1) resulting in a glycine to glutamate change at position 307 
(Fig. 1; SEQ ID NO: 2) (Bruand, C. and EhrUch, S.D. (1995) Microbiology 141, 1199-1200). 

DnaC has been genetically identified to be the major component DNA helicase of 
chromosome rephcation (Sakamoto, Y., Nakai, S., Moriya, S., Yoshikawa, H., and Ogasawara, 
N. (1995) Microbiology 141, 641-644) and is thought to unwind duplex DNA progressively and 
allow for binduig of the DNA polymerase HI holoenzyme necessary for priming and DNA 
synthesis. One possible function of Dnal is as a helicase loader, being responsible for 
transferring DnaC helicase to the oriC. The product of the dnaC and dnal genes are required for 
chromosome repUcation and are all essential for DNA replication in A subtilis (Ceglowski, P., 
Lurz, R., Alonso, J.C.J. (1993) Mol. Biol. 236, 1324-1340). 

Databases were searched for S. aureus genes which may be related to the B. subtilis dnaC 
gene. Utilizhig the B. subtilis amino acid sequence for DnaC (Accession Number P37469), a 
BLAST search was performed of the Staphylococcus database at http://Avww.tigr.org and 
revealed the presence of an ORF within the S. aureus genome encoding a related protein. The 
nucleotide sequence and corresponding protein sequence are presented in Fig. 6A (SEQ ID NO: 
7) and Fig. 6B (SEQ ID NO: 9), respectively. 
Identification of the surf ace of intftrantion on Dnal 

This invention relates, in part, to a specific interaction between a growth-inhibitory 
protem encoded by the Staphylococcus aureus bacteriophage 77 genome and an essential S. 
aureus proteui. This interaction forms the basis for drug screening assays. More specifically, the 
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invention relates to the interacting regions of the protein encoded by the S. aureus bacteriophage 
77 and the S, aureus Dnal proteins, forming the basis for screening assays. The invention 
provides a method for the identification of Dnal polypeptide fragments that are involved in said 
interaction between Dnal and ORF 104 from bacteriophage 77. Several approaches and 
techniques known to those skilled in the art can be used to identify and to characterize fragments 
of the Dnal interacting with 77 ORF 104. These fragments may include, for example, truncation 
polypeptides having a portion of an amino acid sequence of the proteins, or variants thereof, such 
as a continuous series of residues that includes an amino- and/or carboxyl-terminal amino acid 
sequence for DnaL 

Affinity Chromatngraphy 

Partial proteolysis of proteins in solution is one method to delineate the domain 
boundaries in multi-domain proteins. By subjecting proteins to limited digestion, the most 
accessible cleavage sites are preferentially hydrolyzed. These cleavage sites preferentially reside 
in less structured regions which include loops and highly mobile areas typical of the joining 
amino acids between highly structured domains. For this analysis, a purified recombinant Dnal 
polypeptide (including a fragment of Dnal either purified from a previous protease digestion or 
expressed from a recombinant nucleic acid vector as a. fragment) can be subjected to partial 
proteolysis. The proteolysis can be performed with low concentrations of proteases, including, 
but not limited to trypsin, chymotrypsin, endoproteinase Glu-C, and Asp-N with a Dnal 
polypetide in solution, resulting in the generation of defined proteolytic products as observed by 
SDS-PAGE. An acceptable concentration and reaction time is defined by the near complete 
conversion of the fiill-length protein to stable proteolytic products. The partial proteolytic 
fragments are then subjected to affinity chromatography with immobilized 77 ORF 104 to 
determine the region of the Dnal polypeptide containing the 77 ORF 104 binding site. Interacting 
domains are identified by mass spectrometry to determine the masses of both the mtact fragment 
and the series of fragments from a tryptic digest to identify the amino acid residues contained 
within the partial proteolytic fragment. Using both sets of data, the amino acid sequence of the 
partial proteolytic fragment can be precisely determined. 
B) Yeast two-hybrid analysis 

The interaction between 77 ORF 104 and portions of the Dnal polypeptide can also be 
assessed in vivo usmg the yeast two hybrid system. To do this, bacteriophage 77 ORF 104 is 
fused to the DNA binding domain of the yeast transcriptional transactivator Gal4, and different 
portions of the Dnal polypeptide are fiised to the carboxyl terminus of the Gal4 activation 
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domain. The two plasmids bearing such constructs can be introduced sequentially, or in 
combination, into a yeast cell line, for example AH109 (Clontech Laboratories), previously 
engineered to contain chromosomally-integrated copies of E. coU lacZ and the selectable HIS3 
and ADE2 genes. The lacZ^ HIS, and ADE2 reporter genes, each driven by a promoter containing 
5 Gal4 binding sites, are used for measuring protein-protein interactions. If the two recombinant 
protems interact within the yeast cell, the resulting protein:protein complex activates 
transcription from promoters containing Gal4 binding sites. Expression of HIS3, BXidADE2 
genes is manifested by relief of histidine and adenine auxotrophy. As described in the examples 
below, fiill length Dnal, as well as Dnal fragments, was found to interact with bacteriophage 77 

1 0 ORF 1 04 fusion polypeptides using this system. 

Further elucidation of the bacteriophage 77 ORF 104 interacting domain of Dnal can be 
carried out by first subjecting the full length Dnal polypeptide to deletional mutagenesis, the 
methods of which are known to those of skill in the art. The mutated Dnal polypeptides can then 
be subjected to yeast two hybrid analysis as described above, to fiarther narrow those amino acid 

15 sequences or polypeptide fragments, for example, those within SEQ K) NO: 16, that are required 
for the binding of Dnal to bacteriophage 77 ORF 104. 
S. aureus Dnal polypeptides 

In one aspect of the invention there are provided polypeptides of & aureus referred to 
herein as "DnaF' and "Dnal polypeptides" as well as biologically, diagnostically, 

20 prophylactically, clinically or therapeutically useful variants thereof, and compositions 
comprising the same. 

Among the particularly preferred embodiments of the invention are variants of 5. aureus 
Dnal polypeptides encoded by naturally occurring alleles of the dual gene. The present invention 
provides for an isolated polypeptide which comprises or consists of: (a) an amino acid sequence 

25 which has at least 50% identity, preferably at least 80% identity, more preferably at least 90%, 
yet more preferably at least 95%, most preferably at least 97-99% or exact identity, to that of 
SEQ ID N0:2 over the entire length of SEQ ID NO: 2 or b) an amino acid sequence that has at 
least 70% similarity, at least 80% similarity, at least 90% similarity, at least 95% similarity, at 
least 97-99% similarity or even 100% similarity over the entire length of SEQ ID NO: 2. 

30 The polypeptides of the invention include a polypeptide of Fig. I (SEQ ID NO: 2) (in 

particular the mature polypeptide) as well as polypeptides and fragments, particularly those 
which have the biological activity of Dnal, and also those which have at least 50% identity over 
20, 40, 50 or more amino acids to a polypeptide of SEQ ID NO: 2 or the relevant portion, 
preferably at least 60%, 70%, or 80% identity, more preferably at least 90% identity to a 
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polypeptide of SEQ ID NO: 2 and more preferably at least 90% identity to a polypeptide of 
SEQ ID NO: 2 and still more preferably at least 95% identity to a polypeptide of SEQ ID NO: 2 
and yet still more preferably at least 99% identity to a polypeptide of SEQ ID NO: 2. 

The polypeptides of the mvention also include a polypeptide or protein fragment that has 
at least 60%, 70%, 80% or 90% similarity, 95% similarity or even 97-99% similarity over 10, 20, 
25, 30 or more amino acids to a polypeptide of SEQ ID NO: 2. It is preferred that a polypeptide 
of the invention has at least 60% similarity to a polypeptide of SEQ ID NO: 2 over at least 20 
amino acids. 

It is most preferred that a polypeptide of the invention is derived from S. aureus, 
however, it may be obtained from other organisms of the same taxonomic genus. A polypeptide 
of the invention may also be obtained, for example, from organisms of the same taxonomic 
family or order. 

Fragments of Dnal also are included in the invention. These fragments may include, for 
example, truncation polypeptides having a portion of an amino acid sequence of Fig. 1 (SEQ ID 
NO: 2), or variants thereof, such as a continuous series of residues that includes an amino- and/or 
carboxyl-terminal amino acid sequence. Degradation forms of the polypeptides of the invention 
produced by or in a host cell, particularly S. aureus, are also preferred Further preferred are 
fragments characterized by structural or functional attributes such as fragments that comprise 
alpha-helix and alpha-heHx-forming regions, beta-sheet and beta-sheet-formmg regions, turn and 
turn-forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic regions, 
alpha amphipathic regions, beta amphipathic regions, flexible regions, surface-forming regions, 
substrate binding region, and high antigenic index regions. Fragments of Dnal may be expressed 
as fusion proteins with other protems or protein fragments. 

Preferred fragments also mclude an isolated polypeptide comprismg an amino acid 
sequence having at least 20, 30, 40, 50, or 100 contiguous amino acids from the amino acid 
sequence of SEQ ID NO: 2. 

Also preferred are biologically "active" fragments which are those fragments that 
mediate activities of 5. aureus Dnal, including those with a similar activity or an improved 
activity, or with a decreased undesirable activity. Also mcluded are those fragments that are 
antigenic or immunogenic m an animal, especially in a human. Particularly preferred are 
fragments comprising domains that confer a ftmction essential for viability of 5. aureus. 

Fragments of the polypeptides of the invention may be employed for producing the 
corresponding foil-length polypeptide by peptide synthesis; therefore, these variants may be 
employed as intermediates for producing the frill-length polypeptides of the invention. 
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^. aureus Pplynucleotide^ 

It is an object of the invention to provide polynucleotides that encode Dnal polypeptides, 
particularly polynucleotides that encode the polypeptide herein designated 5. aureus Dnal. 
In one aspect of the invention a polynucleotide is provided that comprises a region 
5 encoding a 5. aureus Dnal polypeptide, the polynucleotide comprising a sequence set out in 
SEQ ID NO: L Such a polynucleotide encodes a full length Dnal gene, or a variant thereof. It is 
contemplated that this full-length gene is essential to the growth and/or survival of an organism 
which possesses it, such as S. aureus. 

As a ftirther aspect of the invention there are provided isolated nucleic acid molecules 

1 0 encoding and/or expressing a fragment of a Ml-length Dnal polypeptide, particularly a S. aureus 
Dnal polypeptide or a variant thereof. Further embodiments of the invention include 
biologically, diagnostically, prophylactically, clinically or therapeutically useful polynucleotides 
and polypeptides, and variants thereof, and compositions comprising the same. 

A polynucleotide of the invention is obtained using & aureus cells as starting material, 

1 5 the nucleotide sequence information disclosed in SEQ ID NO: 1, and standard cloning and 

screening methods, such as those for cloning and sequencing chromosomal DNA fragments from 
bacteria. For example, to obtain a polynucleotide sequence of the invention, such as the 
polynucleotide sequence disclosed as in SEQ ID NO: 1, a library of clones of chromosomal 
DNA of & aureus in E. coli or another suitable host is probed with a radiolabeled 

20 oligonucleotide, preferably a 17-mer or longer, derived from a partial sequence. Clones carrying 
DNA identical to that of the probe can be distinguished using stringent hybridization conditions. 
As herein used, the terms "stringent conditions" and "stringent hybridization conditions" mean 
hybridization occurring only if there is at least 95% and preferably at least 97% identity between 
the sequences, A specific example of stringent hybridization conditions is of a overnight 

25 incubation of a hybridization support (e.g., a nylon or nitrocellulose membrane at ATC in a 
solution comprising: I X 10^ cpm/ml labeled probe, 50% formamide, 5x SSC (150mM NaCl, 
l5mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 micrograms/ml of denatured, sheared salmon sperm DNA, followed by 
washing the hybridization support in O.lx SSC at about 65°C. Hybridization and wash conditions 

30 are well known to those skilled in the art and are exemphfied in Sambrook, et al., Molecular 
Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), particularly 
Chapter 1 1 therein. Solution hybridization may also be used with the polynucleotide sequences 
provided by the invention. By sequencmg the individual clones thus identified by hybridization, 
it is possible to confirm the identity of the clone. 
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Alternatively, an amplification process can be utilized to isolate the polynucleotide. In 
this approach, Ihe sequence disclosed as SEQ ID NO: 1 is targeted by two oligonucleotides, one 
identical to a sequence on the coding DNA strand at or upstream of the ATG initiation codon and 
the other which anneals to the opposite strand at or downstream of the stop codon. Priming from 
5 these oligonucleotides in a polymerase chain reaction yields a full length gene coding sequence. 
Such suitable techniques are described by Maniatis, T., Fritsch, E.F. and Sambrook, 
MOLECULAR CLONING: A LABORATORY MANUAL, 2^^ Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989), 

In a further aspect, the present invention provides for an isolated polynucleotide 
10 comprising or consisting of: (a) a polynucleotide sequence which has at least 60% identity, 
preferably at least 70% identity, more preferably at least 80% identity, more preferably at least 
90% identity, yet more preferably at least 95%, most preferably at least 97-99% or exact identity, 
to that of SEQ ID NO: 1 over the entire length of SEQ ID NO: 1; (b) a polynucleotide sequence 
encoding a polypeptide which has at least 50% identity, preferably at least 60% identity, more 
15 preferably at least 70% identity, more preferably at least 80% identity, more preferably at least 
90%, yet more preferably at least 95%, most preferably at least 97-99% or exact identity to SEQ 
ID N0:2 over the entire length of SEQ ID N0:2; or the complement of a sequence of (a) or (b) 
above. 

The invention provides a polynucleotide sequence identical over its entire length to the 
20 coding sequence of SEQ ID NO: 1 . Also provided by the invention is a coding sequence for a 
mature polypeptide or a fragment thereof (Including, for example, a fragment encoding a 
polypeptide of SEQ ID NO: 16), by itself as well as a coding sequence for a mature polypeptide 
or a fragment in reading frame with another coding sequence, such as a sequence encoding a 
leader or secretory sequence, a pre-, or pro-, or prepro-protein sequence. The polynucleotide of 
25 the invention may also contain at least one non-coding sequence, including for example, but not 
limited to at least one non-coding 5' and 3' sequence, such as the transcribed but non-translated 
sequences, tennination signals (such as rho-dependent and rho-independent termination signals), 
ribosorae binding sites, Kozak sequences, sequences that stabilize or destabilize mRNAs, 
introns, andpolyadenylation signals. The polynucleotide sequence may also comprise additional 
30 coding sequence encoding additional amino acids. For example, a marker sequence that 

facilitates purification of the fiised polypeptide can be encoded. In certain embodiments of the 
invention, the marker sequence is a hexa-histidine peptide, as provided in the pQE vector 
(Qiagen, Inc.) and described in Gentz et ah, Proc, Natl. Acad. Sci, 86: 821-824 (1989), or an HA 
peptide tag (Wilson et al., Cell 37: 767 (1984), both of which may be useful in purifying 
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polypeptide sequences fused to them. Polynucleotides of the invention also include, but are not 
limited to, polynucleotides comprising a structural gene and its naturally associated sequences 
that control gene expression. 

It is most preferred that a polynucleotide of the invention is derived from Staphylococcus 
aureus, however, it may also be obtained from other organisms of the same taxonomic genus. A 
polynucleotide of the invention may also be obtamedj for example, from organisms of the same 
taxonomic family or order. 

Further preferred embodiments are polynucleotides encoding & aureus dnal variants that 
have the amino acid sequence ofS. aureus Dnal polypeptide of SEQ ID NO: 2 in which several, 
a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are substituted, modified, deleted 
and/or added, in any combination. Especially preferred among these polynucleotides are those 
encoding silent nucleotide alterations that do not alter the coding sequence or activities of 5. 
aureus Dnal polypeptides tliey encode. 

Preferred embodiments are polynucleotides encodmg polypeptides that retain 
substantially the same biological function or activity as the mature polypeptide encoded by a 
DNAofSEQIDNO: 1. 

In accordance with certain preferred embodiments of this invention there are provided 
polynucleotides that hybridize, particularly under stringent conditions, to S. aureus dnal 
polynucleotide sequences, such as those polynucleotides in Fig. 1. 

The polynucleotides of the invention are useful as hybridization probes for RNA, cDNA 
and genomic DNA to isolate fulUength cDNAs and genomic clones encoding genes that have a 
high degree of sequence identity to the dnal gene. Such probes generally will comprise at least 
15 to about 100 residues or base pairs, although such probes will preferably have about 20 to 50 
nucleotide residues or base pairs. Particularly preferred probes are about 20 to about 30 
nucleotide residues or base pairs in length. 

A coding region of a related dnal gene from a bacterial species other than S. aureus may 
be isolated by screening a library usmg a DNA sequence provided m SEQ ID NO: 1 to 
synthesize an oligonucleotide probe. A labeled oKgonucleotide having a sequence 
complementary to that of a gene of the invention is then used to screen a library of cDNA, 
genomic DNA or mRNA to determme to which member(s) of the library the probe hybridizes. 
There are several methods available and well known to those skilled in the art to obtain 
full-length DNAs, or extend short DNAs, for example those based on the method of Rapid 
Amplification of cDNA ends (RACE) (see, for example, Frohman, et al., PNAS USA 85: 8998- 
9002, 1988). Recent modifications of the technique, exemplified by the MARATHON TM 
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technology (Clontech Laboratories lac.) for example, have significantly simplified the search for 
longer cDNAs. In the MARATHON TM technology, cDNAs are prepared from mRNA 
extracted from a chosen cell and an 'adaptor* sequence is ligated onto each end. Nucleic acid 
amplification by PGR is then carried out to amplify the "missing" 5* end of the DNA using a 
5 combination of gene specific and adaptor specific ohgonucleotide primers. The PGR reaction is 
then repeated using "nested" primers, that is, primers designed to anneal within the amplified 
product (typically an adaptor-specific primer that anneals further 3' in the adaptor sequence and a 
gene-specific primer that anneals further 5' in the selected gene sequence). The products of this 
reaction can then be analyzed by DNA sequencing and a fiill-length DNA constructed either by 

10 joinmg the product directly to the existing DNA to give a complete sequence, or by carrying out 
a sepamte full-length PGR using the new sequence information for the design of the 5' primer. 

The polynucleotides and polypeptides of the invention may be employed, for example, as 
research reagents and materials for discovery of treatments of and diagnostics for diseases, 
particularly human diseases, as fiirther discussed herein relating to polynucleotide assays* 

15 The polynucleotides of the invention that are oligonucleotides derived from a sequence of 

SEQ ID NO: 1 are useful for the design of PGR primers in reactions to determine whether or not 
the polynucleotides identified herein in whole or in part are transcribed in bacteria in infected 
tissue. That is, the polynucleotides of the invention are useful for diagnosis of infection with a 
bacterial strain carrying those sequences. It is recognized that such sequences also have utility in 

20 diagnosis of the stage of infection and type of infection the pathogen has attained. 

The invention also provides polynucleotides that encode a polypeptide that is the mature 
protein plus additional amino or carboxyl-terminal amino acids, or amino acids interior to the 
mature polypeptide. Such sequences may play a role in processing of a protein from precursor to 
a mature form, may allow protein transport, may lengthen or shorten protein half-life or may 

25 facilitate manipulation of a protein for assay or production, among other things. As generally is 
the case in vivo, the additional amino acids may be processed away from the mature protein by 
cellular enzymes. 

A precursor protein, having a mature form of the polypeptide fiised to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed such 
30 inactive precursors generally are activated. Some or all of the prosequences may be removed 
before activation. Generally, such precursors are called proproteins. 

A polynucleotide of the invention thus may encode a mature protein, a mature protein 
plus a leader sequence (which may be referred to as a preprotein), a precursor of a mature protein 
having one or more prosequences that are not the leader sequences of a preprotein, or a 
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preproprotein, which is a precursor to a proprotein, having a leader sequence and one or more 
prosequences, which generally are removed during processing steps that produce active and 
mature forms of the polypeptide. 

In addition to the standard A, G, C, T/U representations for nucleotides, the term "N" 
may also be used in describing certain polynucleotides of the invention. "N" means that any of 
the four DNA or RNA nucleotides may appear at such a designated position in the DNA or RNA 
sequence, except it is preferred that N is not a nucleotide that when taken in combination with 
adjacent nucleotide positions, read in the correct reading frame, would have the effect of 
generating a premature termination codon in such reading frame. 

For each and every polynucleotide of the invention there is also provided a 
polynucleotide complementary to it, 
YecLors, Host Cells> and Expression Systems 

The invention also relates to vectors that comprise a polynucleotide or polynucleotides of 
the invention, host cells that are genetically engineered with vectors of the invention and the 
production of polypeptides of the invention by recombinant techniques. Cell-free translation 
systems can also be employed to produce such proteins using RNAs derived from the DNA 
constructs of the invention 

Recombinant Dnal polypeptides of the present mvention may be prepared by processes 
well known to those skilled in the art from genetically engineered host cells comprising 
expression systems. Accordmgly, in a further aspect, the present invention relates to expression 
systems that comprise a dnal polynucleotide or polynucleotides of the present invention, to host 
cells which are genetically engineered with such expression systems, and to the production of 
polypeptides of the invention by recombinant techniques. 

For recombinant production of Dnal polypeptides of the invention, host cells can be 
genetically engineered to mcorporate expression systems or portions thereof or polynucleotides 
of the invention. Representative examples of appropriate hosts include bacterial cells (Gram 
positive and Gram negative), fungal cells, insect cells, animal cells and plant cells. 
Polynucleotides are mtroduced to bacteria by standard chemical treatment protocols, such as the 
induction of competence to take up DNA by treatment with calcium chloride (Sambrook et al, 
supra), hitroduction of polynucleotides into fungal (e.g., yeast) host cells is effected, if desfred, 
by standard chemical methods, such as lithiim acetate - mediated transformation. 

A great variety of expression systems are useful to produce Dnal polypeptides of the 
invention. Such vectors include among others, chromosomal-, episomal- and virus-derived 
vectors. For example, vectors derived from bacterial plasmids, from bacteriophages, from 
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transposons, from yeast episomes, from insertion elements, from yeast chromosomal elements, 
from viruses, and from vectors derived from combinations thereof, are usefiil in the invention. 

Dnal polypeptides of the invention are recovered and purified from recombmant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid or 
5 urea extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, afSnity chromatography, hydroxylapatite 
chromatography, and lectin chromatography. Well known techniques for refolding may be 
employed to regenerate ari active conformation when the Dual polypeptide is denatured during 
isolation and/or purification. 

10 Piagnostic, Prognostic gerotyping, and Mutation Agsayg 

This invention is also related to the use of dnal polynucleotides and polypeptides of the 
invention for use as diagnostic reagents. Detection of S. aureus dnal polynucleotides and/or 
polypeptides in a eukaryote, particularly a mammal, and especially a human, will provide a 
diagnostic method for diagnosis of disease, staging of disease or response of an infectious 

1 5 organism to drugs. Eukaryotes, particularly mammals, and especially humans, particularly those 
infected or suspected to be infected witti an organism comprising the & aureus dnal gene or 
protein, may be detected at the nucleic acid or amino acid level by a variety of well known 
techniques as well as by methods provided herein. 

Polypeptides and polynucleotides for prognosis, diagnosis or other analysis may be 

20 obtained from a putatively infected and/or infected individuaVs bodily materials. Polynucleotides 
from any of these sources, particularly DNA or KNA, may be used directly for detection or may 
be amplified enzymatically by using PGR or any other amplification technique prior to analysis. 
RNA, particularly mRNA, cDNA and genomic DNA may also be used in the same ways. Using 
amplification, characterization of the species and strain of infectious or resident organism present 

25 in an individual, may be made by an analysis of the genotype of a selected polynucleotide of the 
organism. Deletions and insertions can be detected by a change in size of the amplified product 
in comparison to a genotype of a reference sequence selected from a related organism, preferably 
a different species of the same genus or a different strain of the same species. Point mutations 
can be identified by hybridizing amplified DNA to labeled dnal polynucleotide sequences. 

3 0 Perfectly or significantly matched sequences can be distinguished from imperfectly or more 

significantly mismatched duplexes by DNase or RNase digestion, for DNA or RNA respectively, 
or by detecting differences in melting temperatures or renaturation kinetics. Polynucleotide 
sequence differences may also be detected by alterations in the electrophoretic mobility of 
polynucleotide fragments in gels as compared to a reference sequence. This may be carried out 
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with or without denaturing agents. Polynucleotide differences may also be detected by direct 
DNA or KNA sequencing. See, for example, Myers et al, (1985) Science 230, 1242* Sequence 
changes at specific locations also may be revealed by nuclease protection assays, such as KNase, 
VI and SI protection assay or a chemical cleavage method. See, for example, Cotton et al., 
5 (1985) Proc. Natl Acad. Sci., USA 85, 4397-4401 . 

In another embodiment, an array of oligonucleotide probes comprising dnal nucleotide 
sequence or fragments thereof can be constructed to conduct efficient screening of, for example, 
genetic mutations, serotype, taxonomic classification or identification. Array technology 
methods are well known and have general applicability and can be used to address a variety of 

10 questions in molecular genetics including gene expression, genetic Unkage, and genetic 
variability (see, for example, Chee et al, (1996) Science 274, 610). 

Thus in another aspect, the present invention relates to a diagnostic kit which comprises: 
(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ ID NO: 
1, or a fragment thereof ; (b) a nucleotide sequence complementary to that of (a); (c) a 

1 5 polypeptide of the present invention, preferably die polypeptide of SEQ ID N0:2 or a fragment 
thereof; or (d) an antibody to a polypeptide of the present invention, preferably to the 
polypeptide of SEQ ID N0:2. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
component. Such a kit will be of use in diagnosing a disease or susceptibility to a disease, among 

20 others. 

This invention also relates to the use of dnal polynucleotides of the present invention as 
diagnostic reagents. Detection of a mutated form of a polynucleotide of the invention, preferably, 
SEQ ID NO: 1, which is associated with a disease or pathogenicity will provide a diagnostic tool 
that can add to, or define, a diagnosis of a disease, a prognosis of a course of disease, a 

25 determination of a stage of disease, or a susceptibility to a disease, which results from under- 
expression, over-expression or altered expression of the polynucleotide. Organisms, particularly 
infectious organisms, carrying mutations in such polynucleotide may be detected at the 
polynucleotide level by a variety of techniques, such as those described elsewhere herein. 

The dnal nucleotide sequences of the present invention are also valuable for organism 

30 chromosome identtficatioa The sequence is specifically targeted to, and can hybridize with, a 
particular location on an organism's chromosome, particularly to a S. aureus chromosome. The 
mapping of relevant sequences to chromosomes according to the present invention may be an 
important step in correlating those sequences with pathogenic potential and/or an ecological 
niche of an organism and/or drug resistance of an organism, as well as the essentiality of the 
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gene to the organism. Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map data. 
Such data may be found on-luie in a sequence database. The relationship between genes and 
diseases that have been mapped to the same chromosomal region are then identified through 
known genetic methods, for example, through linkage analysis (coinheritance of physically 
adjacent genes) or mating studies, such as by conjugation. 

The differences in a polynucleotide and/or polypeptide sequence between organisms 
possessing a first phenotype and organisms possessing a different, second different phenotype 
can also be determined. If a mutation is observed in some or all organisms possessing the first 
phenotype but not in any organisms possessing the second phenotype, then the mutation is likely 
to be the causative agent of the first phenotype. 

Polypeptides and polynucleotides for prognosis, diagnosis or other analysis may be 
obtained firom a putatively infected and/or infected mdividuars bodily materials. Particularly 
DNA or polynucleotides, from any of these sources may be used directly for detection or may be 
amphfied enzymatically using PGR or other amplification technique with oligonucleotide 
amplification primers derived firom the polynucleotide sequence ofS. aureus dna 1 . RNA, 
particularly mRNA, or RNA reverse transcribed to cDNA, is also useful for diagnostics. 
Following amplification of^S aureus dnal - related polynucleotide fi-om a sample, 
characterization of the species and strain of infecting or resident organism is made by an analysis 
of the amplified polynucleotide relative to one or more reference polynucleotides or sequences 
relative to a standard fi:om a related organism (i.e. a known strain ofS. aureus). 

Point mutations can be identified by hybridizing amplified DNA to known dnal 
polynucleotide sequences and by detecting differences in melting temperatures or renaturation 
kinetics. Perfectly or significantly matched sequences can be distinguished fi-om imperfectly or 
more significantly mismatcheii duplexes by RNase protection or SI nuclease mapping. (See, for 
example. Cotton et al, (1988) Proc. Natl Acad. Sci. USA 85:4397-4401). Polynucleotide 
sequence differences may also be detected by alterations in the electrophoretic mobility of 
polynucleotide firagments in gels as compared to a reference sequence. This may be carried out 
with or without denaturing agents. Polynucleotide differences may also be detected by direct 
DNA or RNA sequencing. See, for example, Myers et al, (1985) Science 230, 1242. Sequence 
changes at specific locations also may be revealed by nuclease protection assays, such as RNase, 
VI and SI protection assay or a chemical cleavage method.(Cotton et al, 1988 Supra). 

In another embodiment, an array of oligonucleotide probes comprismg dnal nucleotide 
sequence or fi:agments thereof can be constructed to conduct efficient screening of, for example, 
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genetic mutations, serotype, taxonomic classification or identification. Aixay technology 
methods are well known and have general applicability and can be used to address a variety of 
questions in molecular genetics including gene expression, genetic linkage, and genetic 
variability (see, for example, Chee et al., (1996) Science 274, 610). 

In another aspect, the present invention relates to a diagnostic kit which comprises: (a) a 
polynucleotide of the present invention, preferably the nucleotide sequence of SEQ ID NO: 1, or 
a firagment thereof; (b) a nucleotide sequence complementary to that of (a); (c) a polypeptide of 
the present invention, preferably the polypeptide of SEQ ID NO:2 or a fragment thereof; or (d) 
an antibody to a polypeptide of the present invention, preferably to the polypeptide of SEQ ID 
N0:2. Such a kit will be of use in diagnosing a disease or susceptibility to a disease, among 
other uses. 

The invention further provides a process for diagnosing bacterial infections such as those 
caused by S. aureus, the process comprising determining fi-om a sample derived from an 
individual, such as a bodily material, an increased level of expression of a polynucleotide having 
a sequence disclosed in SEQ ID NO: 1 relative to a sample taken fi:om a non-diseased individual. 
Increased or decreased expression of a dnal polynucleotide can be measured using any one of the 
methods well known in the art for the quantitation of polynucleotides, such as, for example, 
PGR, RT-PCR, RNase protection, Northern blotting and other hybridization methods, and 
spectrometry. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of Dnal polypeptide compared to normal control tissue samples may be used to detect 
the presence of an infection, for example. Assay techniques that can be used to determme levels 
of a S. aureus Dnal polypeptide, in a sample derived fi:om a host, such as a bodily material, are 
well-known to those of skill in the art. Such assay methods include radioimmunoassays, 
competitive-binding assays, Westem Blot analysis, antibody sandwich assays, antibody detection 
and ELISA assays, 

.Griddinjg and Polynucleotide Subtraction ofS, aureus Genomic Sequences 

The dnal polynucleotides of the invention may be used as components of polynucleotide 
arrays, preferably high density arrays or grids. These high density arrays are particularly useful 
for diagnostic and prognostic purposes. For example, a set of spots each comprising a different 
gene, and further comprising a polynucleotide or polynucleotides of the invention, may be used 
for probing, such as hybridization or nucleic acid amplification, using a probe obtained or 
derived from a bodily sample, to determine the presence a particular polynucleotide sequence or 
related sequence in an individual. 
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Antibodi es Specific for 5: ^z^my.v Peptides or Polypeptides 

The Dnal polypeptides and polynucleotides of the invention or variants thereof, or cells 
expressing them are useful as immunogens to produce antibodies immunospecific for such 
polypeptides or polynucleotides, respectively. 

In certain preferred embodiments of the invention there are provided antibodies against S. 
aureus Dnal polypeptides or polynucleotides encoding them. Antibodies against Dnal- 
polypeptide or dnal-polynucleotide are useful for treatment of infections, particularly bacterial 
infections. 

Antibodies generated against the polypeptides or polynucleotides of the invention are 
obtained by administering the polypeptides and/or polynucleotides of the invention or epitope- 
bearing fragments of either or both, analogues of either or both, or cells expressing either or 
both, to an animal, preferably a nonhuman, using routine protocols. For preparation of 
monoclonal antibodies, any technique known in the art that provides antibodies produced by 
continuous cell line cultures is useful. Examples include various techniques, such as those in 
Kohler, G. and Milstein, C, Nature 256: 495-497 (1975); Kozbor et al, Immunology Today 4: 
72 (1983); and Cole et al., pg, 77-96 in MONOCLONAL ANTIBODIES AND CANCER 
THERAPY, Alan R, Liss, Inc. (1985). 

Techniques for the production of single chain antibodies (U*S. Patent No. 4,946,778) can 
be adapted to produce single chain antibodies to polypeptides or polynucleotides of this 
invention. Also, transgenic mice, or other mammals, are useful to express humanized antibodies 
inununospecific to flie polypeptides or polynucleotides of the invention. 

When antibodies are administered therapeutically, the antibody or variant thereof is 
preferably modified to make it less immunogenic in the individual For example, if the individual 
is human the antibody is most preferably "humanized," Mrhere the complimentarity determining 
region or regions of the hybridoma-derived antibody has been transplanted into a human 
monoclonal antibody, for example as described in Jones et al. (1986), Nature 321, 522-525 or 
Tempest et al., (1991) Biotechnology 9, 266-273. 

Alternatively, phage display technology is useful to select antibody genes v^^ith bmding 
activities towards a Dnal polypeptide of the invention. In one approach, antibody fragments 
specific for S. aureus Dnal are selected fi-om an immune library of antibody genes expressed as 
fusions with coat protein of filamentous phage. Alternatively, naive libraries are screened by 
phage display techniques to identify genes encoding antibodies specific for Dnal or from naive 
libraries (McCafferty, et al., (1990), Nature 348, 552-554; Marks, et al., (1992) Biotechnology 
10, 779-783; a recent reference is de Haard et al (1999) J Biol Chem 274: 18218-18230). The 



wo 01/46383 



42 



PCT/USOO/35180 



ability to recover, for various targets, antibodies with subnanomolar affinities obviates flie need 
for immunization. The affinity of these antibodies can also be improved by, for example, chain 
shuffling (Clackson et al, (1991) Nature 352: 628). 

The above-described antibodies may be employed to isolate or to identify clones 
5 expressing the polypeptides or polynucleotides of the invention, for example to purify the 
polypeptides or polynucleotides by immunoaffinity chromatography. 

A variant polypeptide or polynucleotide of the invention, such as an antigenically or 
immunologically equivalent derivative or a fusion protein of the polypeptide is also useful as an 
antigen to immxmize a mouse or other animal such as a rat or chicken. A fused protein provides 

10 stability to the polypeptide acting as a carrier, or acts as an adjuvant or both. Alternatively, the 
antigen is associated, for example by conjugation, with an immunogenic carrier protein, such as 
bovine serum albumm, keyhole limpet haemocyanm or tetanus toxoid. Alternatively, when 
antibodies are to be administered therapeutically, alternatively a multiple antigenic polypeptide 
comprising multiple copies of the polypeptide, or an antigenically or immunologically equivalent 

15 polypeptide thereof may be sufficiently antigenic to improve inmiunogenicity so as to obviate 
the use of a carrier. 

In accordance with an aspect of the invention, there is provided the use of a dnal 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular genetic 
immunization. The use of a dnal polynucleotide of the invention in genetic immunization 

20 preferably employs a suitable delivery method such as direct injection of plasmid DNA into 

muscles (Wolff et al, Hum Mol Genet (1992) 1: 363, Manthorpe et al, Hum. Gene Ther. (1983) 
4: 419), delivery of DNA complexed with specific protein carriers (Wu et al, J. Biol. Chem. 
(1989) 264: 16985), coprecipitation of DNA with calcium phosphate (Benvenisty & Reshef, 
PNAS USA, (1986) 83: 9551), encapsulation of DNA in various forms of liposomes (Kaneda et 

25 al. Science (1989) 243: 375), particle bombardment (Tang et al, Nature (1992) 356:152, 
Eisenbraun et al, DNA Cell Biol (1993) 12: 791) or in vivo infection using cloned retroviral 
vectors (Seeger et al., PNAS USA (1984) 81: 5849), 
Antagonists and Agonists: Assays and Molecules 

The invention is based in part on the discovery that Dnal is a target for the bacteria phage 

30 770RF 104 inhibitory factor. Applicants have recognized the utihty of the interaction in the 

development of antibacterial agents. Specifically, the inventors have recognized that 1) Dnal is a 
critical target for bacterial inhibition; 2) 77ORF104 or derivatives or functional mimetics thereof 
are useful for inhibiting bacterial growth; and 3) the interaction between dual and of & aureus 
and 77ORF104 may be used as a target for the screening and rational design of drugs or 
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antibacterial agents. In addition to methods of directly inhibiting Dnal activity, methods of 
inhibiting Dnal expression are also attractive for antibacterial activity. 

In several embodiments of the invention, there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity or 
5 expression of a polypeptide and/or polynucleotide of the invention comprismg: contacting a 
polypeptide and/or polynucleotide of the invention with a compoimd to be screened under 
conditions to permit binding to or other interaction between the compound and the polypeptide 
and/or polynucleotide to assess the binding to or other interaction with the compound, such 
binding or interaction preferably being associated with a second component capable of providing 

10 a detectable signal in response to the binding or interaction of the polypeptide and/or 
polynucleotide with the compound; and determining whether the compound binds to or 
otherwise interacts with and activates or inhibits an activity or expression of the polypeptide 
and/or polynucleotide by detecting the presence or absence of a signal generated from the 
binding or interaction of the compound with the polypeptide and/or polynucleotide. 

15 Potential antagonists include^ among others, small organic molecules, peptides, 

polypeptides and antibodies that bind to a polynucleotide and/or polypeptide of the invention and 
thereby inhibit or extinguish its activity or expression. Potential antagonists also may be small 
organic molecules, a peptide, a polypeptide such as a closely related protein or antibody that 
binds the same sites on a binding molecule, such as a binding molecule, without inducing dnal- 

20 iaduced activities, thereby preventing the action or expression of S, aureus Dnal polypeptides 
and/or polynucleotides by excluding S. aureus Dnal polypeptides and/or polynucleotides from 
binding. 

Potential antagonists also include a small molecule that binds to and occupies the binding 
site of the polypeptide thereby preventing binding to cellular binding molecules, such that 

25 normal biological activity is prevented. Examples of small molecules include but are not limited 
to small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, (1991) J. Neuroohem. 56, 560; see also 
OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, 
CRC Press, Boca Raton, FL (1988), for a description of these molecules). Preferred potential 

30 antagonists include compounds related to and variants of 77ORF104 and of Dnal. Other 
examples of potential polypeptide antagonists include antibodies or, m some cases, 
oligonucleotides or proteins which are closely related to the ligands, substrates, receptors, 
enzymes, etc., as the case may be, of the polypeptide, e.g., a fragment of the hgands, substrates, 
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receptors, enzymes, etc; or small molecules which bind to the polypeptide of the present 
invention but do not elicit a response, so that the activity of the polypeptide is prevented. 

Compounds may be identified from a variety of sources, for example, cells, cell-free 
preparations, chemical libraries, and natural product mixtures. These substrates and ligands may 
be natural substrates and ligands or may be structural or functional mimetics. See, e.g., Coligan 
et al, Current Protocols in hnmunology 1(2): Chapter 5 (1991), Peptide modulators can also be 
selected by screening large random libraries of all possible peptides of a certain length. 

Compounds could also be derived from the polypeptide sequence of 77ORF104 itself. 
Peptide fragments representing small overlapping fragments or peptides spanning the entne 
amino acid sequence of the protein can be used to perform extensive screens. Fragments of 
77ORF104 can be produced by proteolytic digestion of the fuU-lengfh protein as described 
above. Alternatively, suitable 77ORF104-derived peptide or polypeptide fragments 
representative of the complete sequence of the protein can be chemically synthesized. For 
instance, in the multi-pin approach, peptides are simultaneously synthesized by the assembly of 
small quantities of peptides on plastic pins derivatized with an ester linker based on glycolate 
and 4-(hydroxymethyl) benzoate (Maeji et al (1991) Pept Res, 4: 142-6). 

Certain of the polypeptides of the invention are biomimetics, functional mimetics of the 
natural S. aureus Dnal polypeptide. These functional mimetics are usefixl for, among other 
things, antagonizing the activity of 51 aureus Dnal polypeptide or as an antigen or immunogen in 
a manner described above. Functional mimetics of the polypeptides of the invention include but 
are not limited to truncated polypeptides. For example, preferred ftinctional mimetics include a 
polypeptide comprising the polypeptide sequence set forth in SEQ ID NO: 2 lacking 20, 30, 40, 
50, 60, 70 or 80 amino- or carboxy-terminal amino acid residues, including fiision proteins 
comprising one or more of these truncated sequences. Polynucleotides encoding each of these 
functional mimetics may be used as expression cassettes to express each mimetic polypeptide. It 
is preferred that these cassettes comprise 5' and 3' restriction sites to allow for a convenient 
means to ligate the cassettes together when desired. It is fiirther preferred that these cassettes 
comprise gene expression signals known in the art or described elsewhere herein. 
Screening Assavs According to the Invention 

It is desirable to devise screening methods to identify compounds which stimulate or 
which inhibit the function of the Dnal polypeptide or polynucleotide of the invention. 
Accordingly, the present invention provides for a method of screening compounds to identify 
those that modulate the function of a polypeptide or polynucleotide of the invention. In general, 
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antagonists may be employed for therapeutic and prophylactic purposes. It is contemplated that 
an agonist of Dnal may be useful, for example, to enhance the growth rate of bacteria in a sample 
being cultured for diagnostic or other piuposes. 

Screening methods generally fall into two broad categories: those that assay binding of 
candidate compounds; and those that assay a functional aspect of the target, 
a) Binding Assays 

There are a number of methods of examining binding of a candidate compound to a 
protein target such as Dnal. Screening methods that measure the binding of a candidate 
compound to the Dnal polypeptide or polynucleotide, or to cells or supports bearing the 
polypeptide or a fiision protein comprising the polypeptide, by means of a label directly or 
indirectly associated with the candidate compound, are useful in the invention. 

The screening method may involve competition for binding of a labeled competitor such 
as 77ORF104 or a fragment that is competent to bind Dnal. 

i) Phage display 

Phage display is a powerful assay to measure protein:protein interaction. In this scheme, 
proteins or peptides are expressed as fusions with coat proteins or tail proteins of filamentous 
bacteriophage. A comprehensive monograph on this subject is Phage Display of Peptides and 
Proteins. A Laboratory Manual edited by Kay et al (1996) Academic Press, For phages in the Ff 
family that include M13 and fd, gene III protein and gene VIII protein are the most commonly- 
used partners for fixsion with foreign protein or peptides. Phagemids are vectors containing 
origins of replication both for plasmids and for bacteriophage. Phagemids encoding fusions to 
the gene IE or gene VHI can be rescued from their bacterial hosts with helper phage, resulting in 
the display of the foreign sequences on the coat or at the tip of the recombinant phage. 

In the simplest assay, purified recombinant Dnal protein, or a fi-agment of Dnal, for 
example comprising the amino acid sequence of SEQ ID NO: 16, could be immobilized in the 
wells of a microtitre plate and uicubated with phages displaying 77ORF104 in fusion with the 
gene in protein. Washing steps are performed to remove unboxmd phages and bound phages are 
detected with monoclonal antibodies directed against phage coat protein (gene VIH protein). 
Color development by means of an enzyme-linked secondary antibody allows quantitative 
detection of bound fusion protein. Screening for inhibitors is performed by the incubation of the 
compound with the immobilized target before the addition of phages. The presence of an 
inhibitor will specifically reduce the signal in a dose-dependent manner relative to controls 
without inhibitor. 



wo 01/46383 



46 



PCT/USOO/35180 



ii) Surface plasmon resonance 

Another powerful assay to screen for inhibitors of a for protein: protein interaction is 
surface plasmon resonance. Surface plasmon resonance is a quantitative method that measures 
binding between two (or more) molecules by the change in mass near the sensor surface caused 
by the binding of one protem or other biomolecule from the aqueous phase to a second protein or 
biomolecule immobilized on the sensor. This change in mass is measured as resonance units 
versus time after injection or removal of the second protein or biomolecule and is measured 
using a Biacore Biosensor (Biacore AB). Dnal could be immobihzed on a sensor chip (for 
example,, research grade CMS chip; Biacore AB) using a covalent linkage method (e.g. amine 
couphng in 10 mM sodium acetate [pH 4.5]). A blank surface is prepared by activating and 
inactivating a sensor chip without protein immobilization. The binding of 77ORF104 to Dnal, or 
a fragment of Dnal, for example comprismg the amino acid sequence of SEQ ID NO: 16, is 
measured by mjecting purified 77ORF104 over the chip surface. Measurements are performed at 
room temperature. Conditions used for the assay (i.e., those permitting bmding) are as follows: 
25 mM HEPES-KOH (pH 7.6), 150 mM sodium chloride, 15% glycerol, 1 mM dithiothreitol, 
and 0.001% Tween 20 with a flow rate of 10 ul/min. Preincubation of the sensor chip with 
candidate inhibitors will predictably decrease the interaction between 77ORF104 and Dnal. A 
decrease in 77ORF104 binding is indicative of competitive binding by the candidate compound. 

iii) Fluorescence Resonance Energy Transfer (FRET) 

Another method of measuring inhibition of binding of two proteins uses fluorescence resonance 
energy transfer (FRET; de Angelis, 1999, Physiological Genomics). FRET is a quantum 
mechanical phenomenon that occurs between a fluorescence donor (D) and a fluorescence 
acceptor (A) in close proximity (usually < 100 A of separation.) if the emission spectrum of D 
overlaps with the excitation spectrum of A. Variants of the green fluorescent protein (GFP) from 
the icllyfish Aequorea victoria are fused to a polypeptide or protein and serve as D-A pairs in a 
FRET scheme to measure protein-protein interaction. Cyan (CFP: D) and yellow (YEP: A) 
fluorescence proteins are linked with Dnal polypeptide, or a fragment of Dnal, for example 
comprising the amino acid sequence of SEQ ID NO: 16, and 77ORF104 protein respectively. 
Under optimal proximity, interaction between Dnal, or a fragment of Dnal, for example 
comprising the amino acid sequence of SEQ ID NO: 16 and 77ORF104 causes a decrease in 
intensity of CFP concomitant with an increase m YFP fluorescence. 

The addition of a candidate modulator to the mixture of appropriately labeled Dnal and 
77ORF104 protein, will result in an inhibition of energy transfer evidenced by, for example, a 
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decease in YFP fluorescence at a given concentration of 77ORF104 relative to a sample without 
the candidate inhibitor. 

iv) Fluorescence polarization 
In addition to the surface plasmon resonance and FRET methods, fluorescence polarization 
5 measurement is useful to quantitate protein-protein binding. The fluorescence polarization value 
for a fluorescently-tagged molecule depends on the rotational correlation time or tumbling rate. 
Protein complexes, such as those formed by S. aureus Dnal polypeptide, or a fragment of Dual, 
for example comprising the amino acid sequence of SEQ ID NO: 16 associating with a 
fluorescently labeled polypeptide (e.g., 77ORF104 or a binding fragment thereof), have higher 

10 polarization values than a fluorescently labeled monomeric protein. Inclusion of a candidate 
inhibitor of the Dual interaction results m a decrease in fluorescence polarization relative to a 
mixture without the candidate inhibitor if the candidate inhibitor disrupts or inhibits the 
interaction of Dnal with its polypeptide binding partner. It is preferred tibat this method be used 
to characterize small molecules that disrupt the formation of polypeptide or protein complexes, 

15 v) Scintillation Proximity Assay 

A scintillation proximity assay may be used to characterize the interaction between a S. 
aureus Dnal polypeptide, or a fragment of Dnal polypeptide, for example comprising the amino 
acid sequence of SEQ ID NO: 16 and another polypeptide, For the assay, S. aureus Dnal 
polypeptide can be covalently coupled to beads. Addition of radio-labeled 77ORF104 results in 

20 binding where the radioactive source molecule is in close proximity to the scintillation fluid. 
Thus, signal is emitted upon 77ORF104 polypeptide binding, and compounds that prevent 
association between S. aureus Dnal polypeptide and 77ORF104 diminish the scintillation signal, 
vi) Bio Sensor Assay 

ICS biosensors have been described by AMBRI (Australian Membrane Biotechnology 
25 Research Institute; http//www.ambri.com.au/). In this technology, the self-association of 
macromolecules such as Dnal, or a fragment of Dnal, for example comprising the amino acid 
sequence of SEQ ID NO: 16, and bacteriophage 77 ORF 104, is coupled to the closing of 
gramacidin-facilitated ion chaimels in suspended membrane bilayers and hence to a measurable 
change in the admittance (similar to impedence) of the biosensor. This approach is linear over six 
30 order of magnitude of admittance change and is ideally suited for large scale, high through-put 
screening of small molecule combinatorial libraries. 

It is important to note that in assays of protein-protein interaction, it is possible that a 
modulator of the interaction need not necessarily interact directly with the domain(s) of the 
proteins that physically interact. It is also possible that a modulator will interact at a location 
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removed from the site of protein-protein interaction and cause, for example, a conformational 
change in the Dnal polypeptide. Modulators (inhibitors or agonists) that act in this manner are of 
interest since the change they hiduce may modify the activity of the Dnal polypeptide, 
b. Assays of Dnal Functional Activity* 
5 i) Assay for DNA replication, ^H-thymidine incorporation 

To measure the effect of 77ORF104 expression on & aureus DNA replication, the level 
of radiolabeled thymidine incorporation into DNA is measured in the presence or in the absence 
of sodium arsenite (5uM). Samples (0.5 ml) are withdrawn from the cultures at appropriate time 
intervals and mixed to 4.5 ul of labehng solution (0.2 uCi/ml of ^H-thymidine (73 Ci/romol, 
10 NEN Life Science Products, Inc) and 70 pmol of cold thymidine). After 15 min of reaction, 
incorporation is stopped by adding solution containing 0,2% NaNs and 1 mM cold thymidine. 
Samples are precipitated with 10% w/v trichloroacetic acid and filtered through glass fiber filters 
(GF-C, Whatman). The results are expressed as ^H-thymidine counts incorporated nomialized to 
OD culture. 

1 5 ' The assay is performed in the presence of varying concentrations of candidate nihibitors 
in place of 77 ORF104 to screen for inhibitors. At least a 10-fold reduction in ^H-thymidine 
incorporation in the presence of 77 ORF104 or other inhibitor indicates a reduction in Dnal 
activity. 

ii) Plasmid replication 

20 The plasmid pC194 replicates in S, aureus by rolling circle mechanism. The single 

stranded origin, sso of the pC194 is involved in the synthesis of the lagging DNA strand. The 
plasmid pADG6406 is a derivative of pC194 lacking sso. The absence of sso leads the 
accumulation of plasmid single-stranded DNA. The single-stranded (ss) initiation site, ssiA, is 
located on the lagging strand of pAM 1 and is a site for primosome assembly. Ssi A was inserted 

25 into plasmid p ADG6404. S aureus harboring plasmids are grown to mid-log phase and their 
total DNA is extracted and analyzed by Southern hybridization, using ^^P4abeled plasmid DNA 
as probe. The presence of pADG6406 with ssiA is associated to a decrease in the ratio of ss to 
double stranded (ds) DNA compared to that of the plasmid without ssiA. This system is used to 
measure the effect of 77ORF104 or a candidate inhibitor polypeptideexpression on ss DNA 

30 synthesis. In an assay, a plasmid containing 77ORF104 or a candidate inhibitor polypeptide 
coding sequence under an arsenite inducible promotor is introduced into a S aureus strain 
harboring pADG6406. The ratio of ss to ds DNA of pADG6406 is measured in the presence or 
in the absence of sodium arsenite (5 uM), An increase in the ratio of ss to ds DNA (10% or 
more) indicates an effect of the candidate modulator, ha another aspect, the present invention 



wo 01/46383 



49 



PCT/USOO/35180 



relates to a screening kit for identifying agonists, antagonists, ligaiids, receptors, substrates, 
enzymes, etc. for a polypeptide and/or polynucleotide of the present invention; or compounds 
which decrease or enhance the production of such polypeptides and/or polynucleotides, which 
comprises: (a) a polypeptide and/or a polynucleotide of the present invention; (b) a recombinant 
5 cell expressing a polypeptide and/or polynucleotide of the present invention; (c) a cell membrane 
expressing a polypeptide and/or polynucleotide of the present invention; or (d) antibody to a 
polypeptide and/or polynucleotide of the present invention; which polypeptide is preferably that 
of SEQ ID NO: 2, and which polynucleotide is preferably that of SEQ ID NO: 1 . 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
10 component. 

It will be readily appreciated by the skilled artisan lhat a polypeptide and/or 
polynucleotide of the present invention may also be used in a method for the structure-based 
design of-an agonist, antagonist or inhibitor of the polypeptide and/or polynucleotide, by: (a) 
determining in the first instance the three-dimensional structure of the polypeptide and/or 

15 polynucleotide, or complexes thereof; (b) deducing the three-dimensional structure for the likely 
reactive site(s), binding site(s) or motif(s) of an agonist, antagonist or inhibitor; (c) synthesizing 
candidate compounds that are predicted to bind to or react with the deduced binding site(s), 
reactive site(s), and/or motif(s); and (d) testing whether the candidate compounds are indeed 
agonists, antagonists or inhibitors. It will be further appreciated that this will normally be an 

20 iterative process, and this iterative process may be performed using automated and computer- 
controlled steps. 

Each of the polynucleotide sequences provided herein may be used in the discovery and 
development of antibacterial compounds. The encoded protein, upon expression, can be used as a 
target for the screening of antibacterial drugs. Additionally, the polynucleotide sequences 

25 encoding the amino terminal regions of the encoded protein or Shine-Delgamo or other 

translation facilitating sequences of the respective mRNA can be used to construct antisense 
sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide, agonist or 
antagonist of the invention to interfere with the initial physical interaction between a pathogen or 

30 pathogens and a eukaryotic, preferably manmalian, host that is responsible for sequelae of 
infection. In particular, the molecules of the invention may be used: in the prevention of 
adhesion of bacteria, in particular Gram positive and/ox Gram negative bacteria, to eukaryotic, 
preferably mammalian, extracellular matrix proteins on in-dwellmg devices or to extracellular 
matrix proteins in wounds; to block bacterial adhesion between eukaryotic, preferably 
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mammalian, extracellular matrix proteins and bacterial Dnal proteins that mediate tissue damage 
and/or; to block the normal progression of pathogenesis in infections initiated other than by the 
implantation of in-dwelling devices or by other surgical techniques. 

In accordance with yet another aspect of the invention, there are provided dnal 
5 antagonists, preferably bacteriostatic or bacteriocidal antagonists. 

The antagonists of the invention may be employed, for instance, to prevent, inhibit and/or 
treat diseases. 

Cqmpositions, kM md administratipn 

In a further aspect of the invention there are provided compositions comprisuig a dnal 

10 polynucleotide and/or a & aureus Dnal polypeptide for administration to a cell or to a 
multicellular organism. 

The present invention provides for pharmaceutical compositions comprising a 
therapeutically effective amount of a polypeptide and/or polynucleotide, such as the soluble form 
of a polypeptide and/or polynucleotide of the present invention, antagonist peptide or small 

1 5 molecule compound, in combination with a pharmaceutically acceptable carrier or excipient. 
Such carriers include, but are not limited to, saline, buffered saline, dextrose, water, glycerol, 
ethanol, and combinations thereof. The pharmaceutical compositions may be administered in 
any effective, convenient manner including, for instance, administration by topical, oral, anal, 
vaginal, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal or intmdermal 

20 routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as 
an injectable composition, for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application for example in 
the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, mouthwash, 

25 impregnated dressings and sutures and aerosols, and may contain appropriate conventional 
additives, including, for example, preservatives, solvents to assist drug penetration, and 
emollients in ointments and creams. Such topical formulations may also contain compatible 
conventional carriers, for example cream or ointment bases, and ethanol or oleyl alcohol for 
lotions. Such carriers may constitute from about 1% to about 98% by weight of the formulation; 

30 more usually they will constitute up to about 80% by weight of the formulation. Alternative 
means for systemic administration include transmucosal and transdermal administration using 
penetrants such as bile salts or fusidic acids or other detergents. In addition, if a polypeptide or 
other compounds of the present invention can be formulated in an enteric or an encapsulated 
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formulation, oral admimstration may also be possible. Administration of these compounds may 
also be topical and/or localized, in the form of salves, pastes, gels, and the like. 

For administration to mammals, and particularly humans, it is expected that the daily 
dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg, typically around 1 mg/kg. 
5 The physician in any event will determine the actual dosage which will be most suitable for an 
individual and will vary with the age, weight and response of the particular individual The 
above dosages are exemplary of the average case. There can, of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this invention. 
As used herein, the term "in-dwelling device" refers to surgical hnplants, prosthetic 
10 devices and catheters, i.e., devices that are introduced to the body of an individual and remain in 
position for an extended time. Such devices include, but are not limited to, artificial joints, heart 
valves, pacemakers, vascular grafts, vascular catheters, cerebrospinal fluid shunts, urinary 
catheters, continuous ambulatory peritoneal dialysis (CARD) catheters. 

The composition of the invention may be administered by injection to achieve a systemic 
1 5 effect against relevant bacteria shortly before insertion of an in-dwelling device. Treatment may 
be continued after surgery during the in-body time of the device. Li addition, the composition 
could also be used to broaden perioperative cover for any surgical technique to prevent bacterial 
wound infections, especially S. aureus wound infections. 

Many orthopedic surgeons consider that humans with prosthetic joints should be 
20 considered for antibiotic prophylaxis before dental treatment that could produce a bacteremia. 
Deep infection is a serious complication sometimes leading to loss of the prosthetic joint and is 
accompanied by significant morbidity and mortality. It may therefore be possible to extend the 
use of the active agent as a replacement for prophylactic antibiotics in this situation. 

In addition to the therapy described above, the compositions of this invention may be 
25 used generally as a wound treatment agent to prevent adhesion of bacteria to matrix proteins 
exposed in wound tissue and for prophylactic use in dental treatment as an alternative to, or in 
conjunction with, antibiotic prophylaxis. 

Alternatively, the composition of the invention may be used to bathe an iadwellmg 
device immediately before insertion. The active agent will preferably be present at a 
30 concentration of 1 mg/ml to lOmg/ml for bathing of wounds or indwelling devices. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may 
be employed to enhance the immune response. A suitable unit dose for vaccination is 0.5-5 
microgram/kg of antigen, and such dose is preferably administered 1-3 tunes and with an interval 
of 1-3 weeks. With the indicated dose range, no adverse toxicological effects will be observed 
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with the compounds of the invention which would preclude their administration to suitable 
individuals. 

Sequence Databases. Sequences in a Tangible Medium, and Algorithms 

Polynucleotide and polypeptide sequences form a valuable information resource with 
5 which to determine their 2- and 3-dimensional structures as well as to identify further sequences 
of similar homology. These approaches are most easily faciUtated by storing the sequence in a 
computer readable medium and then using the stored data in a known macromolecular structure 
program or to search a sequence database using well known searching tools, such as GCC. 

The polynucleotide and polypeptide sequences of the invention are particularly useful as 

10 components in databases useful for search analyses as well as in sequence analysis algorithms. 
As used in this section entitled "Sequence Databases, Sequences in a Tangible Medium, and 
Algoritlnns," and m claims related to this section, the terms "polynucleotide of the invention" 
and "polynucleotide sequence of the invention" mean any detectable chemical or physical 
characteristic of a polynucleotide of the invention that is or may be reduced to or stored in a 

15 tangible medium, preferably a computer readable form. For example, chromatographic scan data 
or peak data, photographic data or scan data therefrom, called bases, and mass spectrograpliic 
data. As used in this section entitled Databases and Algorithms and in claims related thereto, the 
terms "polypeptide of the invention" and "polypeptide sequence of the invention" mean any 
detectable chemical or physical characteristic of a polypeptide of the invention that is or may be 

20 reduced to or stored in a tangible medium, preferably a computer readable form. For example, 
chromatographic scan data or peak data, photographic data or scan data therefiom, and mass 
spectrographic data. 

The invention provides a computer readable medium having stored thereon polypeptide 
sequences of the invention and/or polynucleotide sequences of the invention. The computer 
25 readable medium can be any composition of matter used to store informatiDn or data, including, 
for example, commercially available floppy disks, tapes, chips, hard drives, compact disks, and 
video disks. 

hi a preferred embodiment of the invention there is provided a computer readable 
medium having stored thereon a member selected from the group consisting of: a polynucleotide 
30 comprising the sequence of SEQ ID NO: 1 or SEQ ID NO: 17; a polypeptide comprising the 
sequence of SEQ ID NO: 2 or SEQ ID NO: 16; a set of polynucleotide sequences wherein at 
least one of said sequences comprises the sequence of SEQ ID NO: 1 or SEQ ID NO: 17: a set of 
polypeptide sequences wherein at least one of said sequences comprises the sequence of SEQ ID 
NO: 2 or SEQ ID NO: 16; a data set representing a polynucleotide sequence comprising Uie 
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sequence of SEQ ID NO: 1 or SEQ ID NO: 17; a data set representing a polynucleotide sequence 
encoding a polypeptide sequence comprising the sequence of SEQ ID NO: 2 or SEQ ID NO: 16; 
a polynucleotide comprising the sequence of SEQ ID NO: 1 or SEQ ID NO: 17; a polypeptide 
comprising the sequence of SEQ ID NO: 2 or SEQ ID NO: 16; a set of polynucleotide sequences 
5 wherein at least one of said sequences comprises the sequence of SEQ ID NO: 1 or SEQ ID NO: 
17; a set of polypeptide sequences wherein at least one of said sequences comprises the sequence 
of SEQ ID NO: 2 or SEQ ID NO: 16; a data set representing a poljoiucleotide sequence 
comprising the sequence of SEQ ID NO: 1 or SEQ ID NO; 17; a data set representing a 
polynucleotide sequence encoding a polypeptide sequence comprising the sequence of SEQ ID 

10 NO: 2 or SEQ ID NO: 16. 

All publications and references, including but not limited to patents and patent 
applications, cited in this specification are herehi incorporated by reference in their entirety as if 
each individual publication or reference were specifically and individually indicated to be 
incorporated by reference herein as being fully set forth. Any patent application to which this 

1 5 application claims priority is also hicorporated by reference herein in its entirety in the manner 
described above for publications and references. 

EXAMPLES 

Example 1 

20 Identification of the inhibitory ORF 104 from Staphylococcus aureus bacteriophage 77. 



The S. aureus propagating strain 77 (PS 77) was used as a host to propagate its respective 
phage 77 (ACTT #27699-Bl), The phage was propagated using the agar layer method described 
by Swanst5rmand Adams (SwanstrSm et al. (1951) Proc. Soc. Exptl Biol & Med. 2S: 372- 

25 375). Phage DNA was prepared from the purified phages as described in Sambrook et al (1989) 
Molecular Cloning: A Laboratory Manual , 2"*^ Edition, Cold Spring Harbor Laboratory Press, 
Cold Sprmg Harbor, NY. Blunt-ended sonicated phage DNA fragments were cloned into the 
pKSII vector (Stratagene). Recombinant clones were sequenced on an ABI 377-36 automated 
sequencer. To ensure co-linearity of the sequence data and the genome, all regions of the phage 

30 genome were sequenced at least once from both directions on two separate clones. Sequence 
contigs were assembled using Sequencher 3.1 software (GeneCodes) (Fig. 2). An 
implementation of the pubUcally available program SEQUIN, available for download at 
ftp://negi.nlm,mh.gov/sequin/, was used on phage genome sequence to identify all putative ORFs 
larger than 33 codons (Fig. 3). 



wo 01/46383 



54 



PCT/USOO/35180 



The 77ORF104 (SEQ ID NO: 4) was amplified by polymerase chain reaction (PGR) from 
phage genomic DNA (Fig. 4), For PGR amplification, the sense strand primer starts at the 
initiation codon and is preceded by a BamHL restriction site; the antisense strand starts at the last 
codon (excluding the stop codon) and is preceded by a SaE restriction site. The PGR product was 
5 gel purified and digested with BamHl and Sail. The digested PGR product was then ligated into 
BamHl" and jS^/I-digested pT vector (Fig. 7 A), and used to transform S. aureus stram RN4220 
(Kxeiswirth et al (1983) Nature 305: 709-712). Selection of recombinant clones was performed 
on Luria-Bertani (LB) agar plates containing 30 ftg/ml of kanamycin. 

Sodium arsenite (NaAsOa) was used to induce gene expression from the ars 

10 promoter/operator. The effect of expression of phage 77 ORFs on bacterial cell growth was then 
evaluated in functional assays on solid medium and in liquid medium. As shown in Fig, 7B, the 
induction of expression of phage 77ORF104 by plating transformants on semi-solid mediimi 
containing 5 piM sodium arsenite results in the inhibition of bacterial growth on solid medium 
compared to plating in the absence of inducer or plating of control non-inhibitory ORF (phage 77 

15 ORF 19) transformants. As shown in Fig. 7C, the density of the culture, as assessed by colony 
forming units (CFU), for S, aureus clones harboring the 77ORF104 increased over time under 
non-induced conditions* Similar growth rates were also observed with transformants harboring a 
non-inhibitory ORF (labeled as 'non killer' on the graphs) under both induced and non-induced 
conditions. At 4 h following induction^ the expression of 77ORF104 is cytocidal resulting in a 

20 0.5 log reduction in the number of GFU compared to the number of GFU initially present in the 
same culture. 

Example 2 

Identification of a S. aureus protein targeted by bacteriophage 77 ORF 104. 

25 To identify S. aureus proteins that interact with the bacterial growth inhibitory 

Staphylococcus bacteriophage 77 ORF 104, a GST-fusion of ORF 104 was generated and the 
recombinant protein purified and utilized to make a GST/ORF104 affinity column. Cellular 
extracts prepared from S. aureus cells were incubated with the affinity matrix, washed with 
increasing salt concentrations and different detergents, and the protein elution profile of the 

30 washes^assessed by SDS-polyactylamide gel electrophoresis. A protein of molecular mass - 40 
kDa was specifically eluted from the affinity matrix. Eluted proteins were further characterized 
to determine the identity of the interacting protein and to vaUdate the interaction of the protein 
with 77ORF104 as described in detail below. 
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^ Generation of GST/ORF 104 recombinant protein . 

Bacteriophage 77ORF104 was sub-cloned into pGEX 4T-1 (Pharmacia), an expression 
vector contaming the GST moiety. The gene encoding ORF104 was obtamed by digestion of 
pT/ORF104 (Fig. 7A) with BamHIwA Sail The DNA fragment containing ORF104 was gel 
5 pnrified by QiaQuick spin colximn (Qiagen) and ligated into pGEX 4T-1 (which had been 
previously digested m^BamHImd Sail) to generate pGEX 4T/ORF104. Recombinant 
expression vectors were identified by restriction enzyme analysis of plasmid minipreps, large- 
scale DNA preparations were performed with Qiagen columns, and the resultmg plasmid was 
sequenced. Test expressions in E. coli DH5 cells containing the expression plasmids were 

10 performed to identify optimal protein expression conditions. coli DH5 cells containing the 
pGEX 4T/ORF104 were grown in Luria-Bertani Broth at "iTC to an ODm of 0,4 to 0,6 and 
induced with 1 mM IPTG at SO^'C for 4 hrs. 
B> Fusion protein purification. 

Cells containing GST/ORF 104 fusion protein were suspended in 20 ml lysis buffer/liter 

15 of cell culture with GST lysis buffer (20 mM Hepes pH 7,2, 500 mM NaCl, 10 % glycerol, 1 
mM DTTj ImM EDTA, ImM benzamidme> and 1 PMSF) and lysed by French Pressure cell 
followed by three bursts of twenty seconds with an ultra-sonicator at 4^C. Triton X-100 was 
added to the lysate to a final concentration of 0.1% and mixed for 30 minutes at 4^C. The lysate 
was centrifuged at 4^C for 30 minutes at 10 000 rpm in a Sorval SS34 rotor. The supernatant 

20 was applied to a 4 ml glutathione sepharose column pre-equilibrated with lysis buffer and 

allowed to flow by gravity. The column was washed with 10 column volumes of lysis buffer and 
eluted in 1 .5 ml fractions with GST elution buffer (20 mM Hepes pH 8.0, 500 mM NaCl, 10 % 
glycerol, 1 mM DTT, O.lmM EDTA, and 25 mM reduced glutathione). The fractions were 
analyzed by 12,5% SDS-PAGE (Laemmli) and visualized by staining with Coomassie Brilliant 

25 Blue R250 stain to assess the amount of eluted QST/ORF104 protein. 

GST/ORP104 (12 mg) was dialyzed overnight against 20 mM Hepes pH 7-5, 150 mM 
NaCI, 10% glycerol, 1 mM DTT, 0. 1 mM EDTA, made up to 2.5 mM CaCla and digested with 
bovine thrombin at a mass ratio of 1:10 (thrombin: GST ORF104) for 2.5 hrs at 28^*0 to cleave 
the GST domain from the ORF104 domain. The digestion was stopped by the addition of 1 mM 

30 PMSF, 1 mM benzamidine and NaCl to a IM final concentration. The digested protein was 
applied to a one ml glutathione sepharose column and flow-through fractions of 1 ml were 
collected. The fractions were analyzed by 12% SDS-PAGE (Tricine) and visualized by staining 
with Coomassie Brilliant Blue R250 stain to determine which fractions contain bacterially 
expressed ORF104 lacking the GST tag. 
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C. Affinity column preparation^ 

GST and GST/ORF104 fusion protein were dialyzed overnight against Affinity 
Chromatography Buffer (ACB; 20 mM Hepes pH 7,5, 10 % glycerol, 1 mM DTT, and 1 mM 
EDTA) containing 1 M NaCl. ORF104 protein obtained from thrombin digestion of 
5 GST/ORF 1 04 was used without dialysis. Protein concentrations were determined by Bio-Rad 
Protein Assay and crosslinked to Affigel 10 resin (Bio-Rad) at protein/resin concentrations of 0, 
0.1, 0,5, LO, and 2.0 mg/ml. The crosslinked resin was sequentially incubated in the presence of 
ethanolamine, and bovine serum albumin (BSA) prior to column packing and equilibration with 
ACB containing 75 mM NaCL 

10 D, S, aureus extract preparation . 

Two extracts were prepared from S. aureus cell pellets. One lysate was prepared by 
French Pressure cell followed by sonication and the other by a lysostaphin digestion followed by 
sonication. The French Pressure cell prepared lysate was prepared by suspending 3g of frozen S. 
aureus cells in ACB containing 500 mM NaCl, 1 mM PMSF, and 1 mM benzamidine. The 

15 suspended cells were subjected to three passes through the French Pressure cell followed by 3 
sonication bursts of 20 seconds each, made up to 0,1% Triton X-100, stirred for 30 minutes, and 
centrifuged at 50 000 rpm for 3 hrs in a Ti70 fixed angle Beckman rotor. The efficiency of cell 
lysis was low and the resulthig lysate (7 ml) contained 2.4 mg/ml protein. The cell pellet after 
the French Pressure cell lysis was subjected to cryogenic grinding in liquid nitrogen in the same 

20 buffer with a mortar and pestle. The lysate was made up to 0.1% Triton X-100, stirred for 30 
minutes, and centrifuged at 50 000 rpm for 3 hrs in a Ti70 fixed angle Beckman rotor yielding a 
lysate (10 ml) containing 2.0 mg/ml protein. The cell lysates were found to be similar by SDS 
PAGE and were pooled, concentrated to 8 ml, and dialyzed overnight in a 3000 Mr dialysis 
membrane against affinity chromatography containing 1 mM PMSF, 1 mM benzamidine, and 75 

25 mM NaCl. The dialyzed protein extract was removed from the dialysis tubing, centrifiiged at 10 
000 rpm in a Sorval SS34 rotor for 1 hr, and assayed for protein content (Bio-Rad Protein Assay) 
and salt concentration (conductivity meter). 

The second lysate was prepared by lysostaphin digestion followed by sonication. A S. 
aureus cell pellet (2.9g) was suspended m 8 ml of 20 mM Hepes pH 7.5, 150 mM NaCl, 10% 

30 glycerol, 1 mM DTT, 1 mM PMSF, 1 mM benzamidme, and 1000 units of lysostaphin. The cell 
suspension was incubated at 37*'C for 30 minutes, cooled to 4°C, and made up to a final 
concentration of I mM EDTA and 500 mM NaCL The lysate was sonicated on ice using three 
bursts of 20 seconds each. The lysate was made up to 0,1% Triton X-100, stirred for 30 mhiutes, 
and centrifiiged at 50 000 rpm for 3 hrs in a Ti70 fixed angle Beckman rotor. The supernatant 
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was removed and dialyzed overnight in a 3000 Mr dialysis membrane against ACB containing 
75 roM NaCl, ImM benzamidine, and 1 mM PMSF. The dialyzed protein extract was removed 
from Hie dialysis tubing, centrifixged at 10 000 rpm in a Sorval SS34 rotor for 1 hr, and assayed 
for protein content (utilizing the Bio-Rad Protein Assay) and salt concentration (utilizing a 
5 conductivity meter). Aliquots of the extracts were frozen at 70°C, 
AfRnity chromatography, 

S, aureus extract (400 ]li1) was applied to 40 |iil columns containing 0, 0.1. 0.5. LO, and 
2.0 mg/ml ligand and ACB containing 75 mM NaCl (400 jul) was applied to an additional 
column containing 2,0 mg/ml ligand. The columns were washed with ACB containing 75 mM 
10 NaCl (400 fxl) and sequentially eluted with ACB contaming 1% Triton X-100 and 75 mM NaCl 
(160 \x\\ ACB containing 250 mM NaCl (160 ACB containing IM NaCl (160 and 1% 
SDS (160 iLil). 40 id of each eluate was resolved by 16 cm 12.5% SDS-PAGE (Laemmli) and the 
eluted proteins were visualized by silver stain. 

EL Identification of S. aureus Dnal homolog as an ORP104 interacting protein 

15 Proteins at approximately 38 kDa were observed specifically in the eluants from the 

GST/ORF104 and ORF104 (GST removed) columns obtained from ACB containing 75 mM 
NaCl and 1% Triton X-100, and 1% SDS (Figs. 8-10; eluting protem indicated by an arrow). 
These bands were excised from the SDS-PAGE gels and prepared for tryptic peptide mass 
determination by MALDI-ToF mass spectrometry (Qin, J., et al (1997) Anal Chem. 69, 3995- 

20 4001). High quality mass spectra were obtained (Fig. 11). The candidate proteins observed in 
the two eluants were identical as determined by the masses of the tryptic peptides (Fig. 1 1). Post- 
Source Decay (PSD) coupled with Collision-Induced Decay (CID) was used to obtain 
fragmentation spectra of tryptic peptides having monoisotopic MH-f masses of 1351.8, 1412.7, 
and 1617,8 Da. The fragment masses were used to search all public domain databases resulting 

25 in no identification. The PSD/CID spectra obtained for the peptide having a monoisotopic MH-f 
mass of 1412.7 were then interpreted to obtain a peptide sequence GHVPBNVTDNDR (SEQ ID 
NO: 10), which was used to BLAST search the S* aureus nucleotide sequence database at 
http://www.genome.ou.edu/staph.html . One nucleotide sequence, Contig 981, in reading frame 
H-3 encoded the similar amino acid sequence GHVPELYVDNNR (SEQ ID NO: 11). This 

30 tentative identification of the candidate protein was then confirmed upon conceptual translation 
and in silica tryptic digestion of the open reading frame found in Contig 981. Furthermore, the 
obtained PSD/CID spectra for tryptic peptides with monoisotopic MH+ masses of 135 L8 and 
1617.8 Da were similar to the predicted PSD/CID fragmentation patterns of the tryptic peptides 
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with monoisotopic MH+ masses of 1351.8 and 1617*8 Da found in the Contig 981 open reading 
frame* Comparison of the Contig 981 open reading frame with all other sequences m the public 
domain databases revealed that Contig 981 is a homologue of Bacillus subtilis Dnal, a protein 
involved in origin-dependent DNA replication (42% identity and 62% similarity) (Table 1), 
5 Yeast two-hvbrid confirmation of Dnal and ORF 104 interaction . 

To validate the identification of the S. aureus dual homolog as an interacting partner of 
bacteriophage 77 ORF 104, the interaction was assessed in vivo in the yeast two-hybrid system. 
As shown in Fig. 12B, bacteriophage 77ORF104 was fused either to the carboxyl terminus of the 
yeast Gal4 DNA binding (pGBKT7, Clontech Laboratories) or activation (pGADT7, Clontech 

10 Laboratories) domains (pGBK77ORF104 and pGAD77ORF104, respectively). The 

polynucleotide sequence of the Dnal homologue was obtained from the S. aureus genomic DNA 
by PCR utilizing oligonucleotide primers that targeted the translation initiation and termmation 
codons of the dnal gene (SEQ ID NO: 1). As shown in Fig. 12A, the sense strand primer targets 
the initiation codon and is preceded by an ^coi?/ restriction site (5 -gaattc-3'); the autisense 

15 oligonucleotide targets the stop codon and is preceded by a jB^mfl? restriction site (5 -ggatcc-3*). 
The PCR product was purified using the Qiagen PCR purification kit and digested with EcoRI 
and BamHI. The digested PCR product was ligated to EcoRI- and fiamffl-digested pGADT7 
vector (pGAD dnal). A similar strategy was used for the cloning of Dnal into pGBKT7 vector 
(pGBK dnal). 

20 As shown in Fig. 12D, the pGAD and pGBK plasmids bearing different combination of 

constructs (as indicated in NO 1 to 6) were introduced into a yeast strain (AH109, Clontech 
Laboratories), previously engineered to contain chromosomally-integrated copies of E, coli lacZ 
and the selectable HIS3 and ADE2 genes, Co-transformants were plated m parallel on yeast 
synthetic medium (SD) supplemented with amino acid drop-out lacking tryptophan and leucine 

25 (TL minus) and on SD supplemented with amino acid drop-out lacking tryptophan, histidine, 
adenine and leucine (THAL minus). Co-transformants harboring the 77 ORF104 polypeptide 
only grew on selective THAL minus media m the presence of Dnal (right Petri, NO 5 and 6). 
hiduction of the reporter HIS3 and ADEl genes is dependent upon the interaction of dnal with 
77ORF104 proteuis since when either plasmid is introduced mto yeast host cells with the control 

30 plasmid (pGBKT7-53 orpGADT7-T), no reporter expression is observed (NO 2 and 3). 

pGADT7"T and pGBKT7-53 are positive control for protein:protein interaction (NO 1) and 
pCLl is an active Gal4 transcription factor (NO 4). Interaction of Dnal and 77ORF104 is also 
demonstrated by the presence of luminescent p-galactosidase activity in 77ORF104-DnaI co- 
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transfomiants (Fig.l2E: NO 5 and 6), These results are consistent with the interpretation that the 
5. aureus Dnal homologue identified herein is the host target of bacteriophage 77ORF104. 

Examp le 3 

5 Identification of the surface of interaction on S. aureus Dnal, 

To identify the specific domain of S, aureus Dnal which participates in the interaction 
with bacteriophage 77ORF104, recombinant Dnal protein was subjected to partial proteolytic 
digestion and appUed to an affinity column containing 77ORF104, Partial proteolytic fragments 
of Dnal interacting with the 77ORF104 were then analyzed by SDS-PAGE and mass 

10 spectrometry, and subsequently characterized by yeast two-hybrid assay to validate the 
interaction of the Dnal sub-firagment with 77 ORF 104 as described in detail below. 
A. Sub-Cloning of Dnal into a bacterial induc ible expression system 

Full-length Dnal, was amplified ftom *?. aureus genomic DNA using the polymemse 
chain reaction (PGR). For PGR amplification of Dnal, the sense strand primer targets the 

15 initiation codon and is preceded by a BamHl restriction site (5'-ggatcc-3*); the antisense 

oligonucleotide targets the stop codon and is preceded by a Sail restriction site (5*-gtcgac-3*) 
(SEQ ID NO: 1). The digested PGR product was purified x^ing the Qiagen PGR purification kit, 
ligated into BamHl and 5^// digested pGEX"6P-l vector (# 27-4597, Amersham Pharmacia 
Biotech), and used to transform E, coli strain BL21 . The sequence integrity of Dnal polypeptides 

20 fiised to GST was verified directly by DNA sequencing. 

Expression of the GST-Dnal recombinant protein from the plasmid pGEX-6P-l-DnaI 
was induced by the addition of 0.5 mM IPTG to a 6 liter culture at OD^oo '^O.S. The protein was 
expressed at 30X for 3h, the cells were harvested by centrifiigation and stored as a cell pellet at 
■~7tf'C. The frozen cell pellet was thawed, resuspended in Buffer 1 (20 mM HBPES pH 7,3, 500 

25 mM NaCl, 10% glycerol, I mM DTT, and 1 mM EDTA) containing 1 mM PMSF audi mM 
benzamidine, and lysed in a French pressure cell followed by three sonication bursts of 20 
seconds each at 4**C. The cell lysate was centrifiiged at 4'*C for 30 minutes at 10 000 rpm. The 
supernatant was applied to a 6 ml glutathione sepharose column equilibrated with Buffer 1, 
washed with 60 ml of Buffer 1 containing 1 mM PMSF and 1 mM benzamidine, and eluted in 6 

30 ml fractions with Buffer 1 containing 50 mM reduced glutathione. Fractions were analyzed by 
12% SDS-PAGE and visualized by Coommassie Brilliant Blue R-250 staining. 
BjCleavage and removal of GST fasion and partial proteolysis of Dnal 

Elution fraction 5 containing 7.0 mg GST-Dnal was dialyzed against Buffer 2 (20 mM 
HEPES pH 7.5, 150 mM NaCl, 10% glycerol, and I mM DTT) and subjected to digestion with 
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40 Units precision protease (Amersham Pharmacia Biotech) at 25^C for 4 hrs. The digested 
GST-Dnal was applied to a 1 ml glutathione sepharose column equilibrated with Buffer 2, the 
flow-through collected, and eluted with Buffer 1 containing 25 mM reduced glutathione. 
Fractions were analyzed by 12% SDS-PAGE and visualized by Coommassie Brilliant Blue R- 
5 250 staining. 

The flow-through fraction, containing Dnal, was dialyzed against buffer 2 and subjected 
to proteolytic digestion in reactions containing the protease/Dnal mass ratio of 1:500 (w/w) of 
chymotrypsin or 1:50 (w/w) of endoproteinase Glu-C for 2h at room temperature. The partial 
proteolysis products obtained from chymotrypsin and endoproteinase Glu-C digestion were used 
10 for affinity chromatography. The proteolytic digestion was stopped by the addition of 1 mM 
PMSF and 1 mM benzamidine and analyzed by SDS-PAGE (one tenth of reaction used for 
analysis). 

C. Affinity chromatography between immobilized 77 ORF 104 and Dnal proteolytic fragments 

77ORF104 protein was cross-linked to Affigel 10 (BioRad) followed by blocking of the 
15 remaining active sites with ethanolamine and the non-specific sites with BSA. The columns 
were equilibrated with ACB containing 1 M NaCI, and ACB containing 100 mM NaCl The 
partial proteolytic digests were diluted to a final volume of 120 [il with ACB containing 100 mM 
and purified BSA was added to a final concentration of 0,1 mg/mL The partial proteolytic 
reaction was split into three fractions, of which 50 ul was appHed to a column containing 
20 77ORF104 crosslinked at 2.0 mg/ml, 50 ul was applied to a column containing no ligand, and 10 
ul was retained for SDS-PAGE, The columns were washed with 10 column volumes of ACB 
containing 100 mM NaCl, 4 column volumes of ACB containing 100 mM NaCl and 1% Triton 
X-100, and eluted sequentially with 4 column volumes ACB containing 1 M NaCl and 1% SDS. 
The flow-through and eluates were precipitated with trichloroacetic acid (TCA) and 
25 washed with an equal volxmie of cold (-70 ^'C) acetone. The TCA-precipitated samples were 
subjected to 15% SDS-PAGE, and the protein visualized by silver staining (Fig. 14 A and B). 

D. Identification of Dnal partial proteolvtic fragments interacting with 77 ORF 104 

The interacting proteolytic fragments were excised, digested by trypsin, and analyzed by 
mass spectrometry. The peptides contained within each of the interacting proteolytic fragments 
30 were analyzed by MALDI-ToF mass spectrometry resulting in the determination of the general 
region of Dnal for each partial proteolytic peptide. The amino and catboxy terminal ends of the 
partial proteolytic fragments were determined for several fragments by the acquisition of mass 
spectrometry data of the unfractionated proteolytic digest followed by mapping the observed 
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mass onto the Ml length Dnal sequence. Partial proteolytic Dual fragments interacting with the 
77 ORF 104 are presented in Fig. 14C. 

E. Sub-Cloning of Dnal fragments into yeast inducible expression system 
The interaction between 77ORF104 and portions of the Dnal polypeptide was also assessed in 
5 vivo in the yeast two-hybrid system. Two portions of the polynucleotide sequence of Dnal were 
amplified by PGR from S. aureus genomic DNA by utiliziag appropriate pairs of 
oligonucleotides (Fig. 15). The portion extending from amino acid residues 64 to 313 was 
obtained with the two following oligonucleotides: the sense strand (with an EcoBl cloning site) 
5'- ccggaattc TATAAAGATCAACAAAAAC-3S SEQ ID NO: 12 and the antisense strand (with 

10 a Bamm cloning site) 5'- cgcggatccTCAATTGTTTCTGAAATT- 3\ SEQ ID NO: 13. 

The polynucleotide sequence encoding amino acids 150-313 of SEQ ID NO: 2 corresponds to 
nucleotides 448 to 942 of SEQ ID NO: 1 and is herein designated SEQ ID NO: 17. The portion 
extending from amino acid residues 150 to 313 was obtained with the two following 
oligonucleotides: the sense strand (with an^coRI cloning site) 5'- 

15 ccggaattcGCAGCAGATGATATTTGT -3% SEQ ID NO: 14and the antisense strand (with a 
Bamm cloning site) 5'- cgcggatccTCAATTGTTTCTGAAATT -3\ SEQ ID NO: 15. The 
digested PGR products were gel purified, ligated into EcoBl- and 5fl/wHI-digested pGADT7 prey 
vector, and used to transform E. coli strain DHlOp. The sequence integrity of the cloned 
products was verified dkectly by DNA sequenchig. 

20 As shown in Fig. 16 different combinaison of bait and prey vectors (as indicated in NO 1 

to 6) was introduced into AH109 yeast cells. Portions of Dnal extending from amino acids 
residues 64 to 3 13 (herein referred to as SEQ ID NO: 18) as well as from 150 to 313 (herein 
referred to as SEQ ID NO: 16) were both found to interact with bacteriophage 77ORF104 smce 
the introduction of appropriate plasmids into host yeast cells resulted in their growth on THAL 

25 minus SD medium (NO 1 and 3). Induction of these reporter genes is dependent upon the 

interaction between Dnal-related polypeptides and 77 ORF 104 since the introduction of control 
plasmids expressing non-interacting protem partners (pGBKLam: NO 2 and 4 or 
77pGADORF13: NO 6) did not result in reporter gene expression (Fig. 16A). 



30 OTHER EMBODIMENTS 

Other embodiments are within the following claims. 
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CLAIMS 

1 . A method of identifying a compound that is active on a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 16, said method comprising 

contacting a candidate compound with said polypeptide, and 
5 detecting binding of said candidate compound to said polypeptide, wherein detection of 

binding is indicative tliat said compound is active on said polypeptide. 

2. The method of claim 1 wherein said detecting comprises the step of measuring the 
binding of a candidate compound, wherein the compound is directly or indirectly detectably 

1 0 labeled, to said polypeptide, 

3. The method of claim 1 wherein said detectmg comprises measurement by phage display, 

4. The method of claim 1 wherein said detecting comprises measurement by surface 
1 5 plasmon resonance. 

5. The method of claim 1 wherein said detecting comprises me^urement by FRET, 

6. The method of claim 1 wherein said detecting comprises measurement of fluorescence 
20 polarization changes. 

7. The method of claim 1 wherein said detecting comprises a scintillation proximity assay. 

8. The method of claim 1 wherem said detecting comprises a biosensor assay, 

25 

9. The method of claim 1 wherein said compound is selected from the group consisting of a 
small molecule, a peptidomimetic compound, and a fragnient or derivative of a bacteriophage 
inhibitor protein, 

30 10. The method of claim 1 wherein said active compound is a peptide synthesized by a 
recombinant expression system and purified, or artificially synthesized. 

11. A method of identifying a compound that is active on a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 16, said method comprising the step of: 
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contacting a first and a second polypeptide in the presence and absence of a candidate 
compound, wherein said first polypeptide comprises the amino acid sequence of SEQ ID NO: 16 
or a fragment or variant thereof that specifically bmds phage 77ORF104 and said second 
polypeptide comprises phage 77ORF104 or a domain thereof that specifically binds a 
5 polypeptide of SEQ ID KO: 16, and detecting the binding of said first and said second 
polypeptides to each other, wherein a decrease in the binding of said first and said second 
polypeptides in the presence of said candidate compound relative to the binding in the absence of 
said candidate compound identifies said candidate compound as a compound that is active on a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 16. 

10 

12 The method of claim 1 1 wherein said first or said second polypeptide is directly or 
indirectly detectably labeled. 

13. The method of claim 1 1 wherein said detecting comprises measurement by phage 
15 display. 

14. The method of claim 11 wherein said detecting comprises measurement by surface 
plasmon resonance. 

20 15. The method of claim 1 1 wherein said detecting comprises measurement by FRET. 

16. The method of claim 1 1 wherein said detecting comprises measurement of fluorescence 
polarization changes. 

25 17. The method of claim 1 1 wherein said detecting comprises a scintillation proximity assay. 

18. The method of claim 1 1 wherein said detecting comprises a biosensor assay. 

19. The method of claim 1 1 wherein said the candidate compound is selected from the group 
30 consisting of a small molecule, a peptidomimetic compound, and a fragment or derivative of a 

bacteriophage inhibitor protein. 

20. The method of claim 1 1 wherein said the candidate compound is a peptide synthesized by 
expression systems and purified, or artificially synthesized. 
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21. An agonist or an antagonist of the activity of a Dnal polypeptide or a gene encoding said 
polypeptide. 

22. A method of identifymg a compound that is active on a Dnal polypeptide, said method 
comprising the steps of: 

contacting a candidate compound with cells expressing a polypeptide comprising SEQ 
ID NO: 16, and 

detecting Dnal activity in said cells, wherein a decrease in activity relative to Dnal 
activity in cells not contacted with said candidate compound is indicative that said candidate 
compound is active on a Dnal polypeptide. 

23. A method of making an antibacterial compound, comprisuig the steps of: 
determining whether a candidate compound is active on a polypeptide comprising the 

amino acid sequence of SEQ ED NO: 16 or a gene encoding said polypeptide; and 

synthesizing or purifying said candidate compound in an amount sufficient to provide a 
therapeutic effect when administered to an organism infected by a bacterium naturally producmg 
a polypeptide comprising the amino acid sequence of SEQ ID NO: 16. 

24. The method of claim 23 wherein the antibacterial compound is selected from the group 
consisting of a small molecule, a peptidomimetic compound, and a fragment or derivative of a 
bacteriophage inhibitor protein, 

25. The method of claim 23 wherein the antibacterial compound is a peptide syntliesized by 
expression systems and purified, or artificially synthesized. 

26. A method for inhibiting a bacterium, comprising contacting said bacterium with a 
compound active on a polypeptide comprising the amino acid sequence of SEQ ID NO: 16 or a 
gene encodmg said polypeptide. 

27. The method of claim 26 wherein said contacting is performed in vitro. 

28. The method of claim 26 wherein said contacting is performed in vivo in an animal. 



wo 01/46383 



65 



PCT/USOO/35180 



29. The method of claim 26 wherein said compound is selected from the group consisting of 
a small molecule, a peptidomimetic compound, and a fragment or derivative of a bacteriophage 
inhibitor protein. 

30. The method of claim 26 wherein said compound is a peptide synthesized by a 
recombinant expression system and purified, or artificially synthesized. 

31. A method for treating a bacterial infection in an animal suffering from an infection, 
comprising administermg to the animal a therapeutically effective amount of a compound active 
on a polypeptide comprismg the amino acid sequence of SEQ ID NO: 16 or a gene encoding the 
polypeptide. 

32. The method of claim 3 1 wherein said compound is selected from the group consisting of 
a small molecule, a peptidomimetic compound, and a fragment or derivative of a bacteriophage 
inhibitor protein. 

33. The method of claim 3 1 wherein said compound is a peptide synthesized by expression 
systems and purified, or artificially synthesized. 

34. A method of prophylactic treatment to prevent bacterial infection comprising contacting 
an indwelling device with a compound active on a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 16 before its implantation into a mammal, such contacting being 
sufficient to prevent 5. aureus infection at the site of implantation. 

35. A method of prophylactic treatment to prevent mfection of an animal by a bacterium 
comprising administering to said animal a compound that is active on a aureus Dnal 
polypeptide comprising the amino acid sequence of SEQ ID NO: 16 or a gene encoding the 
polypeptide in an amount sufficient to reduce adhesion of the bacterium to a tissue surface of 
said animal. 

36. A method of diagnosmg in an individual an infection with Staphylococcus aureus, 
comprising: 
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determining the presence in the individual of a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 16, wherein the presence of said polypeptide is diagnostic for ^y. aureus 
infection. 

37. The method of claim 36 wherein said determining step comprises contacting a biological 
sample from said individual with an antibody specific for an epitope present on a polypeptide 
comprising the amino acid sequence of SEQ ED NO: 16. 

38. A method of diagnosing in an individual an infection with Staphylococcus aureus, 
comprising 

determining the presence in said individual of a nucleic acid sequence encoding a 
polypeptide comprising tlie amino acid sequence of SEQ ID NO: 16. 

39. The method of claim 38 wherein said determining step comprises contacting a nucleic 
acid sample of said mdividual with an isolated, purified or enriched nucleic acid probe of at least 
15 nucleotides in length that hybridizes under stringent hybridization conditions with the 
sequence of SEQ ID NO: 1, or the complement of such probe. 

40. An isolated, purified or enriched polynucleotide comprising a polynucleotide sequence 
that has at least 55% identity to the sequence of SEQ ID NO: 1, or the complement of said 
polynucleotide sequence. 

41. An isolated, purified or enriched polynucleotide comprising a sequence encoding the amino 
acid sequence of SEQ ID NO: 16, or the complement of said polynucleotide, 

42. An isolated, purified or enriched polynucleotide comprising SEQ ID NO: 17 or the 
complement of said polynucleotide sequence, 

43. An isolated, purified or enriched polynucleotide consisting of the sequence of SEQ ID 
NO: 17. 

44. An isolated, purified or enriched polypeptide having at least 55% identity to the amino 
acid sequence of SEQ ID NO: 16, 
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45. An isolated, purified or enriched polypeptide of at least 50 amino acids in lengtli having 
at least 50 % identity to the amino acid sequence of SEQ ID NO: 16. 

46. An isolated, purified or enriched polypeptide having at least 70% similarity to the amino 
acid sequence of SEQ ID NO: 16, 

47. An isolated, purified or enriched polypeptide of at least 20 amino acids in length having 
at least 60% similarity to the amino acid sequence of SEQ ID NO; 16. 

48 . An isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 1 6. 

49. An isolated polypeptide consisting of the amino acid sequence of SEQ ID NO: 16. 

50. An isolated, purified or enriched antibody specific for a polypeptide of SEQ ID NO: 16. 

51. A composition comprismg a bacteriophage 77 ORF 104 polypeptide and a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 16 or a variant thereof that specifically 
binds phage 77 ORF 104 polypeptide, 

52. A composition comprismg a nucleic acid encoding bacteriophage 77 ORF 104 and a 
nucleic acid comprising SEQ ID NO: 17. 
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Siimlarities in sequence between the dnal homolog and 
sequences deposited in public sequence databases. 

Se(iu«ftce5 producing aignificanc alignments: 

gtU40<l25UpJP06S61tOHAl BACSU mKOSOHM. gROTCIK DNAt >git2797,. 
gi 1 463269 mi5l83) ORFV Tsacillua aubciliaj 

gx 12072367 1 emb|CAA70453 1 (V0S2S5I primoaomal protein Dnal (Baci.. 
gx(530419|embtCAA83732t (Z330S8) GXP bind. CD4a/FASl /S£C1B fan.* 
gil23a3431 tAS0007X3) DMA- replication protein OnaC (AquifeK aeo.. 
giUX76732lsplP459l0tYQAM_BACSO HYPOTHETICAL 36.1 KO PROTEIM IN.. 
gi|2l27076|pirl II40411 nypothetical protein 5 (xrc region) - Ba.. 
gxi:72286lUpie39732(XKDC_BACS0 PHACS-UKE ELEMENT P3SX PROTSIN-. 
gH 1353523 tU3a306> ORm"'lBacteciephage tit) 
git23830QQ [A£0006a3] chroiaoaome replication initiator protein 

>giJl40025lsp|P06S67iDNAl^BACSa PRXMOSOMAL PROTEIN DNAI 

>gil27970eipirTlIQ3S44 dUiaJV protein homolog, 44K - 
Bacillus subtiUa >gil 39681 leint)tCAA28633t 1X04363) ORF 
311 (AA 1-3111 taacillus auUtilisJ 

>gill7Gd996|eita7lCAA996a5l (Z7S206) replication protein 
(Bacillus subtilis} >giUZ932Bl (AFOOSZZO) Dnal 
[BaciUua aubtilis] >gi|263S363|em»)CAB1485ai (Z99118) 
hftlicase loader {Bacillus subtilisl 
Length « 311 

Score m 231 bits (583), Expect • 5e-60 

Identities « 120/280 (42%), Positives * 177/280 t62%). Gaps - 2/280 C0%) 

Query: 35 DPDVKQFLEAKRkELTNAMIOSDLNVtOEVKDQQKHYOG-W^rADCPNFVKGHVPELWD 93 

D DV-r- FX* MI— LN L EY *-0 * •♦•C N ++G+ P+L V* 

Sb^ct: 3: DQDVOAFLKENEEn/IDaKMISKSLNXmYrEOSKKCSYCSEDENCWNLLEGYHPKLWN 90 

Cuery: 94 NKaiKIRYLCC?CKIKYDS£Rif:AELITSHHMORDrtNAKLKOiyMNHRDM.DVAt^ 153 

Sbjct: 91 GaSIDXEYYECPVKRKLDQOKKGQSL«KSMYrOQDI.LGATFaQVniSDPSRLAMFOHVTD .150 

Query; 154 ICTAITNGEOVKGLYLYG?rGTGKSrrtGAtw;aUaKKVRSTIlYtPErrRrLtCGG?KO 213 

-r * KGLYLYG rC GK-r+L AIAN+L K* S I+Y+PEr^-R LK *D 

Sb;ct: 151 rLKSYMErGKGKGLYLVGKFGVGKTrWiAArANEUEKSYSSMIVYVPEFVRELKNSLQO 210 

Cuery: 214 qgreKKtHRVR£ANILM:.DDIGnE5VTPttVRDE,VIGPLLHYRMVHEI.PTFrsSNrDYSEI. 273 

* EfKL* 4.LML0DICAE +T WVRDEVIG +L +RM +I.PTFrSSNr EI* 

Sbjcr: 211 QTLEEIONMVKTTPVLMlDDrGAESKTSWVRDEVIGTVtOHRMSQatPTFrSSNrSPOEr. 270 

Query: 274 EHKLMrrRDGE-SKTKAARIIERVKSLST?Vn.SGEMFBM 312 

- -►HK Gt E- KAAR+^ER-*- L* .P L GEM 

Sbjct: 271 KHKFrYSQRGEKEEVKAARLMERlLYLAAPlRLOGENRRH 310 

>gii4«B2€3 {K1S1831 GREY (Bacillus subtilisi 
Length • 207 

Score - 125 bits {310U Supect - 5e-29 

Identities « 67/190 (35\>, Positives - IO5/190 (53%), Caps * 1/190 (01) 

Query: 20 OF5tXXXXXXXXXXXNDP0VKQFLEAKRAELTRAKlDEOLNVLOEYKDQQKHYDG-HKFAD 78 

DF D 0^^ FL* *• ♦ MI^-^ LN L EY +Q A+ 

SbjCt: 15 DFQNRLEQTKEKVMKDCSVOAFLKENESVlDQKKrEKSLMKLYEYIEQSKNCSYCSEDEN 75 

Query: 79 CPNFVKGHVPELYVDNMRIKIRYLQCPCKIKrDEERrEAELITSHHMQRDTLNAKLKDIY 13B 

C N '»-+G'^ P+L V- I t Y -rCP K K D-*-*-^ ▼ t+ S *+Q+D LA ♦ ♦ 
Sbjct: 75 CNNLLEGYHPKLVVNGRSIDrS?yEC?VKRKLDQ0KKQQSLMKSMYICQOLLGATFQQVD 135 



Query: 139 MNHRORLOVAttAADDrCTAITWGEQVKQLYLTfGPrGTGKSFILGAIANQLKSKXVRSTri 198 

RL + D + T XGLYLVG FG GK+F+L AlAN+L K+ S 
SbjCt: 135 iSDPSRLAMFQHVTDFLKSyNETGKGKGLYLYGXFGVGKTFMLAAIANELAEKEYSSMrV I95 

Query: 199 YLPEFIRTLK 208 

Y+PEF+R LK 
Sbjct; 195 YVPEFVRELK 205 
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SEQ ID N0:1 

1 atgggaggag gacagtcaat aatgaagcaa 

61 tttgaaaaaa gaatagaaaa gataaaaaaa 

121 tttttggaag cgcatcgagc tgaattaacg 

181 ttacaagagt ataaagatca acaaaaacafc 

241 aatttcgtaa aggggcatgt gcctgagtta 

301 tatttacaat gcccatgtaa aatcaagtac 

361 acatctcatc atatgcaa'cg agatacttta 

421 catcgagacc gtcttgatgt agctatggca 

481 ggggaacaag tgaaaggcct ttacctttat 

54 i ctaggtgcaa ttgcgaatca gctcaaatct 

601 ccggaattta ttagaacatt aaaaggtggc 

661 catcgcgtaa gagaagcaaa cattttaatg 

721 ccatgggtga gagatgaggt aattggaccfc 

781 ccaacattct ttagttctaa ttttgactat 

841 cgtgatggtg aagagaagac taaagcagca 

901 acaccatact ttttatcagg agaaaatttc 



tttaaaagta taattaacac gtcgcaggac 

gaagtaatca atgacccaga tgttaagcaa 

aatgctatga ttgatgaaga cttaaatgtg 

tatgacggtc ataaatttgc tgattgtcca 

tatgttgata ataaccgaat taaaatacgc 

gacgaagaac gctttgaagc tgagctaatt 

aatgccaaat tgaaagatat ttatatgaat 

gcagatgata tttgtacagc aataactaat 

ggtccatttg ggacaggtaa atcttttatt 

aagaaggtac gttcgacaat tatttattta 

tttaaagatg gttcttttga aaagaaatta 

cttgatgata ttggggctga agaagtgact 

ttgctacatt atcgaatggt tcatgaatta 

agtgaattgg aacatcattt agcgatgact 

cgtattattg aacgtgtcaa atctttgtca 
agaaacaatt ga 



SKQ ID N0:2 

1 MGGGQSIMKQ FKSIIiTTSQD PBKRIEKIKK KVINDPDVKQ FLEAHRAELT NAMIDEDLNV 

61 LQEYKDQQKH YDGHKFADCP NFVKGHVPBIi YVDNNRIKIR YLQCPCKIKY DEERFEAELI 

121 TSHHMQRDTIi NAKLKDIYMN HRDRLDVAMA ADDICTAITW GEQVKGLYriY GPFGTGKSPI 

181 LGAIANQLKS KKVRSTIIYIi PEFIRTLKGG FKOGSFEKKL HRVREAMILM LDDIGAEEVT 

241 PWVRDEVIGP LIiHYRMVHEL PTFFSSNFDY SELEHHLAMT RDGEEKTKAA RIIERVKSLS 

301 TPYFIiSGENP RNN 
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SEQ XD NO s 3 . 

Complete genome sequence of bacteriophage 77 

1 gatcaaaata cttggggaac ggttagggag taaacttcgc gataatttta aaaattcatg 

61 tataaccccc ctcttataac cattttaagg caggtgatga aatggagatt atagtcgatg 

121 aaaatttagt gcttaaagaa aaagaaaggc tacaagtatt atataaagac atacctagca 

181 ataaattaaa agtagttgat ggtttaatta ttcaagcagc aaggctacgt gtaatgcttg 

241 attacatgtg ggaagacata aaagaaaaag gtgattatga tttatttact caatctgaaa 

301 aggcgccacc atatgaaagg gaaagaccag tagccaaact atttaatgct agagatgctg 

361 catatcaaaa aataatcaaa caattatcgg atttattgcc cgaagagaaa gaagacacag 

421 aaacgccatc tgatgattac ctatgattag taataaatac gttgatgaat atataaattt 

481 gtggaaacaa ggaaagataa ttttaaataa agaaagaatt gatctcttta attatctaca 

541 aaaacatata tatfccacgag atgatgtata ttttgatgaa cagaaaatcg aggattgtat 

601 caaatttatt gaaaaatggt attttccaac attaccattt caaaggttta tcatagctaa 

661 tatatttctt atagataaaa atacagatga agctttcttt acagaatttg ctattttcat 

721 gggacgtgga ggcgggaaaa acggtctaat aagtgctatt agtgattttc tttctacgcc 

781 cttacacgga gttaaagaat atcacatctc cattgttgct aatagtgaag atcaagcaaa 

841 aacatcgttt gatgaaatca gaaccgtttt aatggataac aaacgaaata agacgggtaa 

901 aacgccaaaa gctccttatg aagttagtaa agcaaaaata ataaaccgtg caactaaatc 

961 ggttattcga tataacacat caaacacaaa aaccaaagac ggtggacgtg aggggtgtgt 

1021 tatttttgat gaaattcatt atttctttgg tcctgaaatg gtaaacgtca aacgtggtgg 

1081 attaggtaaa aagaaaaata gaagaacgtt ttatataagt actgatggtt ttgttagaga 

1141 gggttatatc gatgcaatga agcacaaaat tgcaagtgta ttaagtggca aggttaaaaa 

12 01 tagtagattg tttgcttttt attgtaagtt agacgatcca aaagaagttg atgacagaca 

1261 gacgtgggaa aaggcgaacc caatgttaca taaaccgtta tcagaatacg ctaaaacact 

1321 gctaagcacg atfcgaagaag aatataacga tttaccattc aaccgttcaa ataagcccga 

1381 attcatgact aagcgaatga atttgcctga agttgacctt gaaaaagtaa tagcaccatg 

1441 gaaagaaata ctagcgacta atagagagat accaaattta gataatcaaa tgtgtattgg 

1501 tggtttagac tttgcaaaca ttcgagattt tgcaagtgta gggctattat tccgaaaaaa 

1561 cgatgattac atttggttag gacattcgtt tgtaagacaa gggtttttgg atgatgtcaa 

1621 attagaacct cctattaaag aatgggaaaa aatgggatfca ttgaccattg tcgatgatga 

1681 tgtcattgaa attgaatata tagttgattg gtttttaaag gctagagaaa aatatgggct 

1741 tgaaaaagtc atagctgata attatagaac tgatattgta agacgtgcgt ttgaggatgc 

1801 tggcataaaa cttgaagtac ttagaaatcc aaaagcaata catggattac ttgcaccacg 

1861 tatcgataca atgtttgcga aacataacgt aafcatatgga gacaatcctt tgatgcgttg 

1921 gtttactaat aatgttgctg taaaaatcaa gccggatgga aataaagagt atatcaaaaa 

1981 agatgaagtc agacgtaaaa cggatggatt catggctttt gttcacgcat tatatagagc 

2041 agacgatata gtagacaaag acatgtctaa agcgcttgat gcattaatga gtatagattt 

2101 ctaatagagg aggtgagaca tgagtattct agaaaagata tttaaaacta ggaaagatat 

2161 aacatatatg cttgatttag atatgataga agatctatca caacaagcgt atgtgaaacg 

2221 tttagcgatt gatagttgta ttgaatttgt tgcgcgagct gtcgctcaaa gtcattttaa 

2281 agtattggaa ggtaatagaa ttcaaaagaa tgatgtttac tacaagttaa atataaaacc 

2341 aaatactgac ttatcaagcg atagtttttg gcaacaagtt atatataaac taatttatga 

2401 taacgaggtt ttaatcgtag taagtgacag caaagaatta .cttatcgcag atagctttta 

2461 cagagaagag tacgctttgt atgatgatafc attcaaagat gtaacggtta aagattatac 

2521 ttatcaacgt actttcacaa tgcaagaggt catatattta aagtacaaca acaataaagt 

2581 gacacacttt gtagaaagtc tattcgaaga ttacgggaaa atattcggaa gaatgatagg 

2641 tgcacaatta aaaaactatc aaataagagg gattttgaaa tctgcctcta gcgcatatga 

2701 cgaaaagaat atagaaaaat tacaagcgtt cacaaataaa ttattcaata cttttaataa 

2761 aaatcaacta gcaatcgcgc ctttgataga aggttttgat tatgaggaat tatctaatgg 

2821 tggtaagaat agtaacatgc ctttttctga attgagtgag ctaatgagag atgcaataaa 

2881 aaatgttgcg ttgafcgattg gtatacctcc aggtttgatt tacggagaaa cagctgattt 

2941 ggaaaaaaac acgcfctgtat ttgagaagtt ctgtttaaca cctttattaa aaaagattca 

3001 gaacgaatta aacgcgaaac tcataacaca aagcatgtat ttgaaagata caagaataga 

3061 aattgtcggt gtgaataaaa aagacccact tcaatatgct gaagcaattg acaaacttgt 

3121 aagttctggt tcatttacaa ggaatgaggt gcggattatg ttaggtgaag aaccatcaga 

3181 caatcctgaa ttagacgaat acctgattac taaaaactac gaaaaagcta acagtggtga 
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3241 aaatgatgaa aaagaaaaag atgaaaacac tfctgaaaggt ggtgatgaag atgaaagcgg 

3301 agattaaagg cgtcatcgtt tccaacgaag ataaatgggt ttacgaaatg cttggtatgg 

3361 attcgacttg tcctaaagat gttttaacac aactagaatt tagtgatgaa gatgttgata 

3421 ttataattaa ctcaaatggt ggtaacctag tagctggtag tgaaatatat acacatttaa 

34 81 gagctcataa aggcaaagtg aatgttcgta tcacagcaat agcagcaagt gcggcatcgc 

3541 ttatcgcaat ggctggtgac cacatcgaaa tgagtccggt tgctagaatg atgattcaca 

3601 atccttcaag tattgcgcaa ggagaagtga aagatctaaa tcatgctgca gaaacattag 

aacatgttgg tcaaataatg gctgaggcat atgcggttag agctggtaaa aacaaacaag 

3 721 aacttataga aatgatggct aaggaaacgt ggctaaatgc tgatgaagcc attgaacaag 

3781 gfctttgcgga tagtaaaatg tttgaaaacg acaatatgca aattgtagca agcgatacac 

3 841 aagtgttatc gaaagatgta ttaaatcgtg taacagcttt ggtaagtaaa acgccagagg 

3 901 ttaacattga tattgacgca atagcaaata aagtaattga aaaaataaat atgaaagaaa 

3 961 aggaatcaga aatcgatgtt gcagatagta aattatcagc aaatggattt tcaagattcc 

4021 ttttttaata caaaaatagg aggtcataaa atgactataa atttatcgga aacafctcgca 

4081 aafcgcgaaaa acgaatttat taatgcagta aacaacggtg aaccgcaaga aagacaaaat 

4141 gaattgtacg gtgacatgat taaccaacta tfctgaagaaa ctaaattaca agcaaaagca 

4201 gaagctgaaa gagtttctag tttacctaaa tcagcacaaa ctttgagtgc aaaccaaaga 

, 4261 aatttcttta tggatatcaa taagagtgct ggatataaag aagaaaaact tttaccagaa 

4321 gaaacaattg atagaatctt cgaagattta acaacgaatc atccattatt agctgactta 

4381 ggtattaaaa atgctggttt gcgtttgaag ttcttaaaat ccgaaacttc tggcgtggct 

4441 gtttggggta aaatctatgg tgaaattaaa ggtcaattag atgctgcgtt cagtgaagaa 

4501 acagcaattc aaaataaatt gacagcgttt gttgttttac caaaagattt aaatgatttt 

4561 ggtcctgcgt ggattgaaag atttgttcgt gttcaaatcg aagaagcatt tgcagtggcg 

4621 cttgaaactg cgttcttaaa aggtactggt aaagaccaac cgattggctt aaaccgtcaa 

4681 gtacaaaaag gtgtatcggt aactgatggt gcttatccag agaaagaaga acaaggtacg 

4741 cttacatttg ctaatccgcg cgctacggtt aatgaattga cgcaagtgtt taaataccac 

4801 tcaactaacg agaaaggtaa atcagtagcg gttaaaggta atgtaacaat ggttgttaat 

4861 ccgtccgatg cttttgaggt tcaagcacag tatacacatt taaatgcaaa tggcgtatat 

4921 gttactgctt taccatttaa tttgaatgtt attgagtcta cagttcaaga agcaggtaag 

4981 gttttaacgt acgttaaagg tctatatgat ggttatttag ctggtggtat taatgttcag 

5041 aaatttaaag aaacacttgc gttagatgat atggatttat acactgcaaa acaatttgct 

5101 tacggcaaag cgaaagataa taaagttgct gctgtttgga aattagattt aaaaggacat 

5161 aaaccagctt tagaagatac cgaagaaaca ctataaaatt ttatgaggtg ataaaatggt 

5221 gaaatttaaa gttgttagag aatttaaaga catagagcac aatcaacaca agtacaaagt 

520 i aggggagttg tatccagctg aagggtataa caatcctcgt gttgaattgt tgacaaatca 

5341 aatcaaaaat aagtacgaca aagtttatat cgtaccttta gataagctga caaaacaaga 

5401 attattagaa ctatgcgaat cattacaaaa aaaagcgtct agttcaatgg ttaaaagtga 

5461 aatcatcgac ttattgaatg gtgaagacaa tgacgattga tgatttgctt gtcaaattta 

5521 aatcacttga aaagattgac cataattcag aggatgagta cttaaagcag ttgttaaaaa 

55B1 tgtcgtacga gcgtataaaa aatcagtgcg gagtttttga attagagaat ttaataggtc 

5641 aagaattgat acttatacgc gctagatatg cttatcaaga tttattagaa cacttcaacg 

5701 acaattacag acctgaaata atagattttt cgttafcctct aatggaggta tcagaagatg 

5761 aagaaagtgt ttaagaaacc tagaattaca actaaacgtt taaatacgcg tgttcatttt 

5821 tataagtata ctgaaaataa tggtccagaa gctggagaaa aagaagaaaa attattatat 

5881 agctgttggg cgagtattga tggtgtctgg ttacgtgaat tagaacaagc tatctcaaac 

5941 ggaacgcaaa atgacattaa attgtatatt cgtgatccgc aaggtgatta tttacccagt 

6001 gaagaacatfc atcttgaaat tgaatcaaga tatttcaaaa atcgtttgaa tataaagcaa 

6061 gtatcaccag atttggataa taaagacttt attatgattc gcggaggata tagttcatga 

6121 gtgtgaaagt gacaggtgat aaagcattag aaagagaatt agaaaaacat tttggcataa 

6IBI aagagatggt aaaagttcaa gataaggcgt taatagctgg tgctaaggfca attgttgaag 

6241 aaataaaaaa acaactcaaa ccttcagaag actcaggagc actgattagt gagattggtc 

6301 gtactgaacc tgaatggata aaggggaaac gtactgttac aattaggtgg cgtgggcctt 

S361 ttgaacgatt tagaatagta catttaattg aaaatggtca tgttgagaaa aagtcaggaa 

6421 aatfctgtaaa acctaaagct atgggtggga ttaatagagc aataagacaa gggcaaaata 

6481 agtattttga gacgcfcaaaa agggagttga aaaaattgtg attgatatfct tgtacaaagt 

6541 tcatgaagtg attagtcaag acagaattat tagagagcac gtaaafcatca ataatattaa 

6601 gttcaataaa taccctaatg taaaagatac tgatgtacct tttattgtta ttgacgatat 
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6661 cgacgaccca atacctacaa cttatactga 

6721 ccaaatagat gtttttgtta agtacaatga 

6781 gafcatctaat cgcattcaaa agttattatg 

6841 tggaaaaccg gaatatatag aagaatttaa 

6901 cattttttat aaggaggaaa attaaatggc 

6961 tattaacatt actggtttag gttfccgctaa 

7021 tagtgatatt acaaaaacaa gaggattaca 

7081 aaaaacagct tatgctgatg gcggtccaat 

7141 aatctcafcta caaatgcatg cgttccctaa 

7201 ttatgatgaa gatggcgttt acgaagagaa 

7261 atggttcaga caagagcgta aagacggtac 

7321 gtttacaaat cctaaaatcg atggagaaac 

7381 agaggtfcgaa ggtgaggcac ttttcccttt 

7441 tatctttgat tcagctaaca tgacaaatca 

7501 tttcttaaag aaaattttag gcgaagaata 

7S61 aactfctgtaa caaaaccggc ttcatcggaa 

7621 agcattaaaa cacttaaagt tggcgacaca 

7681 aatcaaagta agttattgaa atacacaaca 

7741 gatggtcaag ttactgcgga agcacaaggc 

7801 atgagtgaca ctataacaat aaatgtagaa 

7861 ttgaaaataa ggagagtatf ataaaatggc 

7921 agaagatcca aaagcaaatg aaattaaatt 

7981 atttgaaatt gtatacgaag caatggattt 

8041 gatgaagcca agagaaatcg ctgacagatt 

6101 ccaattcaca gttaaagacc taaaagaacg 

8161 tcgtgaacaa gtgattttca ttactcaagg 

8221 ccagaacatg aaataaagcc tgaagattta 

8281 actctcatga tggacttaat tgaaaatggt 

8341 tttcattatg tgctttccat atatcaaaat 

8401 gaggctttaa ttgatgcatt ttaaccttaa 

8461 ttttagaaag gaggtaaaaa atgggagaaa 

8521 tagatgcagc aaatttaaat agatcatttg 

B5B1 attctgactt aaaattaaca ggcaacaact 

8641 acaaacaaag gattaaagaa cttgatggaa 

8701 atttagccaa gcaatatgac aaggtatctc 

8761 aaaagttacg acaagaatat aacaaacaag 

8821 tacaaaaaac atcagccgaa tttgaagagt 

8881 tggcagaaag tggctgggga aaaaccagta 

8941 caaaaatggg tgatggttta aaatccattg 

9001 ctgttttagg tattgcagca gcatcaggaa 

9061 atactgttac tcaagcaaca ggcgcaacag 

9121 ttaaagatgt ttatggcaat tttccagcag 

9181 aagttaatac aaggttaggt tttacaggta 

9241 tgaaattcag tcatataaca ggttctgacg 

9301 caatgggcga tgcaggtatc gaagcaagtg 

9361 aagcggcgca agctagtggg ataagtgttg 

9421 gcgctccaat gagagctatg ggctttgaga 

9481 gggaaaagtc aggcgttaat actgaaatag 

9541 attggggtaa agcfcggtaaa aacccaagag 

9601 aaaagacgcc ggatatagct agcgcaacaa 

9661 caggtcctga tttagcagac gctattaaag 

9721 aaactattga agattcccaa ggcacagtaa 

9781 ccgaaagatt taaagtagca atgaataaat 

9841 ctattgaaag tgcgtttgct cccgtaatgg 

9901 ttgabtggfct ttccaattta agtgatggtt 

9961 ttgctgctgc aattggtcct gtagtttttg 
10 021 atgcagtaac tgtattagct ccattgttag 



cggagatgag tgtgcatata gttatattgt 
tgaatataat gcgagaatca taagaaataa 
gtctgaacta aaaatgggaa atgtttcaaa 
aacatafcaga agctctcgcg tttacgaggg 
agtaaaacat gcaagtgcgc caaaggcgta 
attaacgaaa gaaggcgcgg aattaaaata 
aaaaattggt gttgaaactg gtggagaact 
tgaatcaggg aatacagacg gagaaggtaa 
agagattcgc aaaatcgttt ttaatgaaga 
acaaggtaaa caaaacaatt acgtagctgt 
atttagaaca gttttattac ctaaagttat 
ggctgagaaa gattgggatt tctcaagtga 
agttgataat aaaaagtcag tacgtaagta 
tgatggagac ggtgaaaaag gcgaagaggc 
tactggaaac gtgacagagg gcaacgaaga 
actgcggtaa agtcggttaa tataccagat 
tacgatttaa atgttgtagt agagccatct 
gatcaaacga atattgtatc aatcaatagt 
attgctacgg ttaaagcaac agttggtaat 
gcataagagg gggcaacccc tctattttat 
aaaattaaaa cgtaacatta ttcaattagt 
acaaacgtac ttaacaccac acttcatttc 
aatcgatgat attgaggacg aaaatagcac 
gatggatatg gttgtaaaaa tttacgataa 
tatgcatgca ccfcgatggaa tgaatgcact 
tcaacaaact gaggaaacta gaaattttat 
acatataaag caatgttgaa aaatatggat 
aaagacgcta acgaagtttfc aaaaatgcca 
aaaaataatg acatttctga agaaaaagca 
ccgtttggtt agggttattt ttttgaactt 
gaataaaagg tttatctata ggtttggatt 
cagaaatcaa acgaaacttt aaaactbtaa 
tcaaatatac cgaaaaatca actgatagtt 
ctatcacagg fctataagaaa aacgttgatg 
aagaacaggg cgaaaacagt gcagaagctc 
caaatgagct gaattattta gaaagagaat 
tcaaaaaagc fccaagttgaa gctcaaagaa 
aagtttttga aagtafcggga cctaaattaa 
gtaaaggttt gatgattggt gtaactgcac 
aagctfcttgc agaagttgat aaaggtttag 
gcagtgaatt aaaaaaattg cagaactcat 
atgctgaaac tgttggtgga gttttaggag 
aagaacttga aaatgccaca gagtcattct 
gtgtgcaagc cgtacagtta attacccgtg 
aatatcaaag tgttfctggat atggtagcaa 
atacattagc tgatagtatt actaaatacg 
tgaaagaatc aattgcttta ttctctcaat 
cattcagtgg tttgaaaaaa gctatatcaa 
aagaatttaa gaagacatta gcagaaattg 
gtttagcgat tgaagcattt ggtgcaaagg 
gtggtcgctt tagttatcaa gaatttttaa 
accaaacatt taaagattct gaaagtggct 
taaaattagt aggtgctgat gtatgggctt 
aagaattaat caaaaagcta tctatagcgg 
ctaaaagatc aattgttatt ttcagtggta 
ggttaggtgc atttataagt acaattggca 
ctagtattgc aaaggctggt ggattgatta 
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10081 gttttttatc gactaaagta cctatattag gaactgtctt cacagcttta actggtccaa 
10141 ttggcattgt attaggtgta ttggctggtt tagcagtcgc atttacaatt .gcttataaga 
10201 aatctgaaac atttagaaat tfctgttaatg gtgcaattga aagtgttaaa caaacattta 
10261 gtaattttat tcaatttatt caacctttcg ttgattctgt taaaaacatc tttaaacaag 
10321 cgatatcagc aatagttgat ttcgcaaaag atatttggag tcaaatcaat ggattcttta 
10381 atgaaaacgg aattfcccatt gttcaagcac ttcaaaatafc atgcaacttt attaaagcga 
10441 tatttgaatt tattttaaat tttgtaatta aaccaattat gttcgcgatt tggcaagtga 
10501 tgcaatttat ttggccggcg gttaaagcct tgattgfccag tacttgggag aacataaaag 
10561 gtgtaataca aggtgcttta aatatcatac ttggcttgat taagttcttc tcaagtttat 
10G21 tcgttggtga ttggcgagga gttfcgggacg ccgttgtgat gattcttaaa ggagcagttc 
10681 aattaatttg gaatttagtt caattatggt ttgtaggtaa aatacttggt gttgttaggt 
10741 actttggcgg gttgctaaaa ggattgatag caggaatttg ggacgtaata agaagtatat 
loaol tcagtaaatc tttatcagca atttggaatg. caacaaaaag tatttttgga tttttattta 
10861 atagcgtaaa atcaattttc acaaatatga aaaattggtt atctaatact tggagcagta 
10921 tccgtacgaa tacaatagga aaagcgcagt cattatttsig tggcgtcaaa tcaaaattta 
10981 ctaatttatg gaatgcgacg aaagaaattt ttagtaattt aagaaattgg atgtcaaata 
11041 tttggaattc cattaaagat aatacggtag gaattgcaag ccgtttatgg agtaaggtac 
11101 gtggaatttt cacaaatatg cgcgatggct tgagttccat tatagataag attaaaagtc 
11161 atatcggcgg tatggtaagc gctattaaaa aaggacttaa taaattaatc gacggtttaa 
11221 actgggtcgg tggtaagttg ggaatggata aaatacctaa gttacacact ggtacagagc 
11281 acacacatac tactacaaga ttagttaaga acggtaagat tgcacgtgac acattcgcta 
11341 cagttgggga taagggacgc ggaaatggtc caaatggttt tagaaatgaa atgattgaat 
11401 tccctaacgg taaacgtgta atcacaccta atacagatac taccgcfctat ttacctaaag 
11461 gctcaaaagt atacaacggt gcacaaactt attcaatgtt aaacggaacg cttccaagat 
11521 ttagtttagg tactatgtgg aaagatatta aatctggtgc atcatcggca tttaactgga 
11581 caaaagataa iaataggtaaa ggtaccaaat ggcttggcga taaagttggc gatgttttag 
11641 attttatgga aaatccaggc aaacttttaa attatatact tgaagctttt ggaattgatt 
11701 tcaatfccttt aactaaaggt atgggaattg caggcgacat aacaaaagct gcatggtcta 
11761 agattaagaa aagtgctact gattggataa aagaaaattt agaagctatg ggcggtggcg 
11821 atttagtcgg cggaatatta gaccctgaca aaattaatta tcattatgga cgtaccgcag 
11881 cttataccgc tgcaactgga agaccatttc atgaaggtgt cgattttcca tttgtatacc 
11941 aagaagttag aacgccgatg ggtggcagac ttacaagaat gccatttatg tctggtggtt 
12001 atggtaatta tgtaaaaatt actagtggcg ttatcgatat gctatttgcg catttgaaaa 
12061 actttagcaa atcaccacct agtggcacga tggtaaagcc cggtgatgtt gttggtttaa 
12121 ctggtaatac cggatttagt acaggaccac atttacattt tgaaatgagg agaaatggac 
12181 gacattttga ccctgaacca tatttaagga atgctaagaa aaaaggaaga ttafccaatag 
12241 gtggtggcgg tgctacttct ggaagtggcg caacttatgc cagtcgagta atccgacaag 
12301 cgcaaagtat tttaggtggt cgttataaag gtaaatggat tcatgaccaa atgatgcgcg 
12361 ttgcaaaacg tgaaagtaac taccagtcaa atgcagtgaa taactgggat ataaatgctc 
1242 X aaagaggaga cccatcaaga ggattattcc aaatcatcgg ctcaactttt agagcaaacg 
12481 ctaaacgfcgg atatactaac tttaataatc cagtacatca aggtatctca gcaatgcagt 
12541 acatfcgttag acgatatggt tggggtggtt ttaaacgtgc tggtgattac gcatatgcta 
12601 caggtggaaa agtttttgat ggttggtata acttaggtga agacggtcat ccag^atgga 
12.661 ttattccaac agatccagct cgtagaaatg atgcaatgaa gattttgcat tatgcagcag 
12721 cagaagtaag agggaaaaaa gcgagtaaaa ataagcgtcc tagccaatta tcagacttaa 
12781 acgggtttga tgatcctagc ttattattga aaatgattga acaacagcaa caacaaatag 
12841 ctttattact gaaaatagca caatctaacg atgtgattgc agataaagat tatcagccga 
12901 ttattgacga atacgctttt gataaaaagg tgaacgcgtc tatagaaaag cgagaaaggc 
12961 aagaatcaac aaaagtaaag tttagaaaag gaggaattgc tattcaatga tagacactat 
13021 taaagtgaac aacaaaacaa ttccttggtt gtatgtcgaa agagggtttg aaataccctc 
13081 ttttaattat gttttaaaaa cagaaaatgt agatggacgt tcggggtcta tatataaagg 
13141 gcgtaggctt gaatcttafca gttttgatat acctttggtg gtacgtaatg actatttatc 
13201 tcacaacggc attaaaacac atgatgacgt cttgaatgaa ttagtaaagb tttttaacta 
13.261 cgaggaacaa gttaaattac aattcaaatc taaagattgg tactggaacg cttatttcga 
13321 aggaccaata aagctgcaca aagaatttac aatacctgtt aagttcacta tcaaagtagt 
133 81 actaacagac ccttacaaat attcagtaac aggaaataaa aatactgcga tttcagacca 
13441 agtttcagtt gtaaatagtg ggactgcfcga cactccttta attgttgaag cccgagcaat 
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13501 taaaccatct agttacttta tgattactaa aaatgatgaa gattatttta tggttggtga 
13561 tgatgaggta accaaagaag ttaaggatta catgcctcct gtttatcata gtgagtttcg 
13621 tgatttcaaa ggttggacta agatgattac tgaagatatt ccaagtaatg acttaggtgg 
13681 taaggtcggc ggtgactttg tgatatccaa tcttggcgaa ggatataaag caactaattt 
13741 tcctgatgca aaaggttggg ttggtgctgg cacgaaacga gggctcccta aagcgatgac 
13801 agattttcaa attacctata aatgtattgt tgaacaaaaa ggtaaaggtg ccggaagaac 
13861 agcacaacat atttatgata gtgatggta^i gttacttgct tctattggtt atgaaaataa 
13921 atatcatgat agaaaaatag gacatattgt tgttacgttg tataaccaaa aaggagaccc 
13981 caaaaagata tacgactatc agaataaacc gataatgtat aacttggaca gaatcgttgt 
14041 ttatatgcgg ctcagaagag taggtaataa attttctatt aaaacttgga aatttgatca 
14101 cattaaagac ccagatagac gtaaacctat tgatatggat gagaaagagt ggatagatgg 
14161 cggtaagttt tatcagcgtc cagcttctat catagctgtc tatagtgcga agtataacgg 
14221 ttataagtgg atggagatga atgggttagg ttcattcaat acggagattc taccgaaacc 
14281 gaaaggcgca agggatgtca ttatacaaaa aggtgattta gtaaaaatag atatgcaagc 
14341 aaaaagtgtt gtcatcaatg aggaaccaat gfctgagcgag aaatcgtttg gaagtaatta 
14401 tttcaatgtt gattctgggt acagtgaatt aatcatacaa cctgaaaacg tctttgatac 
14461 gacggttaaa tggcaagata gatatttata gaaaggagat gagagtgtga tacatgtttt 
14521 agattttaac gacaagatta tagatttcct ttctactgat gacccttcct tagttagagc 
14581 gattcataaa cgtaatgtta atgacaattc agaaatgctt gaactgctca tatcatcaga 
14641 aagagctgaa aagttccgtg aacgacatcg tgttattata agggattcaa acaaacaatg 
14701 gcgtgaattt attattaact gggttcaaga tacgatggac ggctacacag agatagaatg 
14761 tatagcgtct tatcttgctg atataacaac agctaaaccg tatgcaccag gcaaatttga 
14821 gaaaaagaca acttcagaag cattgaaaga tgtgttgagc gatacaggtt gggaagtttc 
14881 tgaacaaacc gaatacgatg gcttacgtac tacgtcatgg acttcttatc aaactagata 
14941 tgaagtttta aagcaattat gtacaaccta taaaatggtt ttagattttt atattgagct 
15001 tagctctaat accgtcaaag gtagatatgt agtactcaaa aagaaaaaca gcttattcaa 
15061 aggtaaagaa attgaatatg gtaaagattt agtcgggtta actaggaaga ttgatatgtc 
15121 agaaatcaaa acagcattaa ttgctgtggg acctgaaaat gacaaaggga agcgtttaga 
15181 gctagttgtg acagatgacg aagcgcaaag tcaattcaac ctacctatgc gctatatttg 
15241 ggggatatat gaaccacaat cagatgatca aaatatgaat gaaacacgat taagttcttt 
15301 agccaaaaca gagttaaata aacgtaagtc ggcagttatg tcatatgaga ttacttctac 
15361 tgatttggaa gttacgtatc cgcacgagat tatatcaatt ggcgafcacag tcagagtaaa 
15421 acatagagat tttaacccgc cattgtatgt agaggcagaa gttattgctg aagaatataa 
15481 cataatttca gaaaatagca catatacatt cggtcaacct aaagagttca aagaatcaga 
15541 attacgagaa gagtttaaca agcgattgaa cataatacat caaaagttaa acgataatat 
15601 tagcaatatc aacactatag ttaaagatgt tgtagatggt gaattagaat actttgaacg 
15661 caaaatacac aaaagtgata caccgccaga aaatccagtc aatgatatgc tttggtatga 
15721 tacaagtaac cctgatgttg ctgtcttgcg tagatattgg aatggtcgat ggattgaagc 
15781 aacaccaaat gatgttgaaa aattaggtgg tataacaaga gagaaagcgc tattcagtga 
15841 attaaacaat atttttatta atttatctat acaacacgct agfcctttfcgt cagaagctac 
15901 agaattactg aatagcgagt acttagtaga taatgatttg aaagcggact tacaagcaag 
15961 tttagacgct gtgattgatg tttataatca aattaaaaat aatttagaat ctatgacacc 
16021 cgaaactgca acgattggtc ggttggtaga tacacaagct .ttatttcttg agtatagaaa 
16081 gaaattacaa gatgtttata cagatgtaga agatgtcaaa atcgccattt cagatagatt 
16141 taaattatta cagtcacaat acactgatga aaaatataaa gaagcgttgg aaataatagc 
16201 aacaaaattt ggtttaacgg tgaatgaaga tttgcagtta gtcggagaac ctaatgttgt 
16261 taaatcagct attgaagcag ctagagaatc cacaaaagaa caattacgtg actatgtaaa 
16321 aacatcggac tataaaacag acaaagacgg tattgttgaa cgtttagata ctgctgaagc 
163 81 tgagagaacg actttaaaag gtgaaatcaa agataaagtt acgttaaacg aatatcgaaa 
X6441 cggattggaa gaacaaaaac aatatactga tgaccagtta agtgatttgt ccaataatcc 
16501 tgagattaaa gcaagtattg aacaagcaaa tcaagaagcg caagaagctt taaaatcata 
16561 cattgafcgct caagatgatc ttaaagagaa ggaatcgcaa gcgtatgcfcg atggtaaaat 
16621 ttcggaagaa gagcaacgcg ctatacaaga tgctcaagct aaacttgaag aggcaaaaca 
16681 aaacgcagaa ctaaaggcta gaaacgctga aaagaaagct aatgcttata cagacaacaa 
16741 ggtcaaagaa agcacagatg cacagaggaa aacattgact cgctatggtt ctcaaattat: 
16801 acaaaatggt aaggaaatca aattaagaac tacfcaaagaa gagtttaatg caaccaatcg 
16861 tacactttca aatatattaa acgagattgt tcaaaatgtt acagatggaa caacaatcag 
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16921 atatgatgat aacggagtgg ctcaagcttt gaatgtgggg ccacgtggta ttagattaaa 
16981 tgctgataaa attgatatta acggtaatag agaaataaac cttcttatcc aaaatatgcg 
17041 agataaagta gataaaaccg atattgtcaa cagtcttaat ttatcaagag agggtcttga 
17101 tatcaatgtt aatagaattg gaattaaagg cggtgacaat aacagatatg ttcaaataca 
17161 gaatgattct attgaactag gtggtattgt gcaacgtact tggagaggga aacgttcaac 
17221 agacgatatt tttacgcgac tgaaagacgg tcacctaaga tttagaaata acaccgctgg 
17281 cggttcact't tatatgtcac attttggtat ttcgacttat attgatggtg aaggtgaaga 
17341 cggtggttca tctggtacga ttcaatggtg ggataaaact tacagtgata gtggcatgaa 
17401 tggtataaca atcaattcct atggtggbgt cgttgcacta acgtcagata ataatcgggt 
17461 tgttctggag tcttacgctt catcgaatat caaaagcaaa caggcaccgg tgtatttata 
17521 tccaaacaca gacaaagtgc ctggattaaa ccgatttgca ttcacgctgt ctaatgcaga 
17501 taatgcttat tcgagtgacg gttatattat gtttggttct gatgagaact atgattacgg 
17641 tgcgggtatc aggttttcta aagaaagaaa taaaggtctt gttcaaattg ttaatggacg 
17701 atatgcaaca ggtggagata caacaatcga agcagggtat ggcaaattta atatgctgaa 
17761 acgacgtgat ggtaataggt atattcatat acagagtaca gacctactgt ctgtaggttc 
17821 agatgatgca ggagatagga tagcttctaa ctcaatttat agacgtactt attcggccgc 
17881 agctaatttg catattactt ctgctggcac aattgggcgt tcgacatcag cgcgtaaata 
17941 caagttatct atcgaaaatc aatataacga tagagatgaa caactggaac attcaaaagc 
18001 tattcttaac ttacctatta gaacgtggtt tgataaagct gagtctgaaa ttttagctag 
18061 agagctgaga gaagatagaa aattatcgga agacacctat aaacttgata gatacgtagg 
18121 tttgattgct gaagaggtgg agaatttagg attaaaagag tttgtcacgt atgatgacaa 
18181 aggagaaatt gaaggtatag cgtatgatcg tctatggatt catcttatcc ctgttatcaa 
18241 agaacaacaa ctaagaatca agaaattgga ggagtcaaag aatgcaggat aacaaacaag 
18301 gattacaagc taatcctgaa fcatacaattc attatttatc acaggaaatt atgaggttaa 
18361 cacaagaaaa cgcgatgtta aaagcgtata tacaagaaaa taaagaaaat caacaatgtg 
18421 ctgaggaaga gtaatcctta gcactatttt tatacaaaaa tttaaggagg tcatttaatt 
18481 atggcaaaag aaattatcaa caatacagaa aggtttattt tagtacaaat cgacaaagaa 
18541 ggtacagaac gtgtagtata tcaagatttc acaggaagtt ttacaacttc tgaaatggtt 
18601 aaccatgctc aagattttaa atctgaagaa aacgctaaga aaattgcgga gacgttaaat 
18661 ttgttatafcc aattaactaa caaaaaacaa cgtgtgaaag tagttaaaga agtagttgaa 
18721 agatcagatt tatctccaga ggtaacagtt aacactgaaa cagtatgaaa agctatgagt 
18781 tagatactca tagtctfctat tcttttagaa agcgggtgta ctgaattggg gtggttcaaa 
18841 aaacacgaac atgaatggcg catcagaagg ttagaagaga atgataaaac aatgctcagc 
18901 acactcaacg aaattaaatt aggtcaaaaa acccaagagc aagttaacat taaattagat 
18961 aaaaccttag atgctattca aaaagaaaga gaaatagatg aaaagaataa gaaagaaaat 
19021 gataagaaca tacgtgatat gaaaatgtgg gtgcttggtt tagttgggac aatatttggg 
19081 tcgctaatta tagcattatt gcgtatgctt atgggcatat aagagaggtg attaccatgt 
19141 tcggattaaa ttttggagct tcgctgtgga cgtgtttctg gtttggtaag tgtaagtaat 
19201 agttaagagt cagtgcttcg gcactggcth tttattttgg ataaaaggag caaacaaatg 
19261 gatgcaaaag taataacaag atacatcgta ttgatcttag cattagtaaa tcaattctta 
19321 gcgaacaaag gtattagccc aattccagta gacgatgaaa ctatatcatc aataatactt 
193 81 actgtagtcg ctttatatac aacgtataaa gacaatccaa catctcaaga aggtaaatgg 
19441 gcaaatcaaa aattaaagaa atataaagct gaaaataagt atagaaaagc aacagggcaa 
19501 gcgccaatta aagaagtaat gacacctacg aatatgaacg acacaaatga tttagggtag 
19561 gtggttgata tatgttaatg acaaaaaatc aagcagaaaa atggtttgac aattcattag 
19621 ggaaacaatt caacccagat ggttggtatg gatttcagtg ttatgattac gccaatatgt 
19681 tctttatgtt agcgacaggc gaaaggctgc aaggtttata tgcttataat atcccgtttg 
19741 ataataaagc aaagattgaa aaatatggtc aaataattaa aaactatgac agctttttac 
19801 cgcaaaagtt ggatattgtc gttttcccgt caaagtatgg tggcggagct ggacacgttg 
19861 aaattgttga gagcgcaaat ttaaatactt tcacatcatt tggtcaaaac tggaacggta 
19921 aaggttggac taatggcgtt gcgcaacctg gttggggtcc tgaaactgtg acaagacatg 
19981 ttcattatta tgacaatcca atgtatttta ttaggttaaa cttccctaac aacttaagcg 
20041 ttggcaataa agctaaaggt attattaagc aagcgactac aaaaaaagag gcagtaatta 
20101 aacctaaaaa aattatgctt gtagccggtc atggttataa cgatcctgga gcagfcaggaa 
20161 acggaacaaa cgaacgcgat tttatacgta aatatataac gcctaatatc gctaagtatt 
20221 taagacatgc aggacatgaa gttgcattat acggtggctc aagtcaatca caagatatgt 
20281 atcaagatac tgcatacggt gttaatgtag gcaataaaaa agattatggc ttatattggg 



8/39 



wo 01/46383 



PCT/USOO/35180 



20341 ttaaatcaca ggggtatgac attgttctag aaatacattt agacgcagca ggagaaagcg 
20401 caagtggtgg gcatgttatt atctcaagtc aattcaatgc agatactatt gataaaagta 
20461 tacaagatgt tattaaaaat aacttaggac aaataagagg tgtgacacct cgtaatgatt 
20521 tactaaatgt taatgtatca gcagaaataa atataaatta tcgtttatct gaattaggtt 
20581 fctattactaa taaaaatgat atggattgga ttaagaaaaa ctatgacttg tattctaaat 
20641 taatagccgg tgcgattcat ggtaagccta taggtggttt ggtagctggt aatgttaaaa 
20701 catcagctaa aaacaaaaaa aatccaccag tgccagcagg ttatacactc gataagaata 
20761 atgtccctta taaaaaagaa caaggcaatt acacagtagc taatgttaaa ggtaataatg 
20821 taagagacgg ttattcaact aattcaagaa ttacaggggt attacccaac aacacaacaa 
20881 ttacgtatga cggtgcatat tgtattaatg gttatagatg gattacttat attgctaata 
20941 gtggacaacg tcgttatata gcgacaggag aggtagacaa ggcaggtaat agaataagta 
21001 gttttggtaa gtttagcacg atttagtatt tacttagaat aaaaattttg ctacattaat 
21061 tatagggaat cttacagtta ttaaataact atttggatgg atgttaatat tcctatacac 
21121 tttttaacat ttctctcaag atttaaatgfc agataacagg caggtacttc ggtacttgcc 
21181 tattttttta tgttatagct agccttcggg ctagtttttt gttatgatgt gttacacatg 
21241 catcaactat ttacatctat ccttgttcac ccaagcatgt cactggatgt tttttcttgc 
21301 gatagagagc atagttttca tactactccc cgtagtatat atgactttag cattcccgta 
21361 taacagttta cggggtgctt ttatgttata atfcgcfctfcta tatagtagga gtgaactata 
21421 tagccgggca gaggccatgt atctgactgt tggtcccaca ggagacatct tccttgtcat 
21481 cactcgatac atatatctta acaacataga aatgttacat tcgctataac cgtatcttaa 
21541 tcgatacggt tatatttatt cccctacaac caacaaaacc acagatccta ttaatttagg 
21601 attgtggtta ttttttgcgt tfcttttgggg caaaaaaagg gcagattatt tgaaaaaggg 
21661 caaacgcttg tggaaaagct aaaaggttaa aaatgacaaa aaccttgata caacagtgfct 
21721 tttggacgct cgtgtacgtt agagaatgac cggtttacca tcatacaagg gtgggattaa 
21781 cttgtgttaa aaagccfctta atatcagttg ttacaaagga tttgtagcgt ctttaaaaat 
21841 aaaaaagggc agaaaaaggg cagatacctt tcagtacaca agtttttcta atttttgctc 
21501 taactctctg tccattttct ctgttacatg tgtatacacc tttatagtcg ttttttcatc 
21961 tgtatgtcct actcttttca taattgcttt taacgatata ttcatttccg ccaataaact 
22021 tatgtgtgta tgccttagtg tgtgagtagt aactttttta ttfcatattta atgattctgc 
22081 agctgaggac aatcgUttgt ttatcctact gccttgcata ggatttcctt ggcaagttgt 
22141 gaatataaac cctctatcaa catagcttgg ttcccattgt tgcatctttt tattttctaa 
22201 cattattttt ttca^tacat ttgctatcct tgaattgatg gcgatzttttc ttcttgaacc 
22261 tgcggtctta gtagtatctt tgtgaccaaa tccagcatta cafcttgattc tgtgaatagt 
22321 gccattaata gcgatcgttfc tatttttgag gtcaacatct ttaacttgga gagctaataa 
22381 ctcacctatg cgcatacctg ttaaagcttg aacttctaca gccccagcaa ctaaaatacg 
22441 agctctatac tgcatgttat tatcgfctcag tataaaatcg cgtatctgta ttacctgttc 
22501 catctctaaa tagttataca ttttcgcttc ttctttttct atatcttcta tcgtcttact 
22561 cttctttggt agtgtgacgc tatttaatat gtgttcgttt ggafcaattgt aaaatttaac 
22621 ggcgtattta atagcttctt tcatatgtcc aagttgacgc tttacctgat ttgcagaata 
22681 tacgtttgat aatttgttaa taaatgtttg catgtacttt gtatcaattt tgtttaaaag 
22741 taaattttga gaactgttct ttttgatgtt tttgattctt gttttcaaat tatcaagcgt 
22801 cgttacttta aagccagatg tttttatatg atattcaagc cattcatcta ataacgcgtg 
22861 aaaagtcaaa gtttttaatt cgcttgacga cttgttgttt agtttttctt ttattttttc 
22921 ttctaaacga aacattgcct ctttttgcga ttgctttgta ttcttattca agacaacact 
22981 tacacgtttc catttatctg tatacggatc tttgtatttc tcgtagtatc tatacttcgt 
23041 ttcattgttc ttatttttaa atttttcaaa ccacatttta catccctcct caaaattggc 
23101 aaaaaataat aagggtaggc gggctaccca tgaaaattgt ataaaaaaag acgcctgtat 
23161 aaaatacaga cgccacttat aattataaga ttacatggtt aattaccaaa aatggtaacg 
23221 aatatatacg tgttttaaag gataaacctt taatatatta aaattatatc atcttatatc 
23281 agggatctgc aatatattat tattaattct atttatcagt aacataatat ccgaagaatc 
23341 tattactgga tttttaattt tttggggtaa aacfctttctt atgcgaaact tactaatcgg 
23401 ctggaaagaa tttatgcaag cgtaactatt accttttaat ttttttacct tatcaattgc 
23461 tgatactatg ttattaatgt ttctgtcaat tttatthaat tt'attttcaa tttctaaact 
23521 atcagatata aattcaataa aataatcttt agtgatgaat tctgtgttgt ttttttggta 
23581 ttttttatcg aaaacttctt ttaatatagc tgaattattt tgcgcgctaa ttaaatttaa 
23641 aaacaatctt aaataatact cccatttcaa atcaaaattc atctttaaat actttttgtt 
23701 ttctttagag gataagggaa taacatttac tatatcctcc gtattagaat catttttatt 
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23761 catcactatt gcaaagtgtg aattagaaaa ttctttatta acgtttatac cgaaatctac 
23821 aaaaactatt tctccttgtt taaactttgg ataaaaacct ttatggtttt tttcacct'tc 
23881 aaatctcttg agtaaatagt gaatatctga atctaacttt ttaaattttg gatttccaga 
23941 agtttttaat ttattaatgc gtttttctat attatgcgtc atcatfctctc ctttattctc 
24001 gctcacactc tcaccaccat tcaacgtcta cacttgtagg cgttttttga ttagtaaaat 
24061 cataatgaat cttctttggt taacttatcg ccatctattt tttgtgaaat aaattccaag 
24121 tatttacgcg cattatgtga cgataaatct . ttaggtaact cataagtgaa tggttgatta 
24181 ccactagtta aaacttcata tacGatagtt tctttttfcta ttttgcaatt agttattttc 
24241 attataaact ccttttaaac actgctgaaa tagacgtctt tttcaaataa gcatgattaa 
24301 tactttaatt ctttaatcca catatattta aaagtgaggt agtaggtaat aaatataaga 
24361 cttaaagtta agattgcttt tttcatgtca atttctcctt tgtttatatt tatattaaag 
24421 cgctaaatat acgttattaa tcacaataca actttgccca ttactttaat atcactaaac 
24481 gaagcgactt tgatatcatc atacttcgga tttagagata ccaaattaat atagtcttcg 
24541 catatatcta cacgcttgat aagacttact ccatctaata caacgagtgc aattgtacca 
24601 tctttaatag aatcttcttt cttaataaaa gcgtatgttc cttgttttaa cataggttcc 
24661 attgaatcac cattaactaa aatacaaaaa tcagcatttg atggcgtttc gtcttcttta 
24721 aaaaatactt cttcatgcaa tatgtcatca tataattctt ctcctatgcc agcaccagtt 
24781 gcaccacatg caatatacga tactagttta gactctttat attcatctat agaagtgact 
24841 ttattctgtt catctaattg ctcatttgca tagttaagta cgttttcttg gcggggaggt 
24901 gtgagttgag aaaatatgtt attgattttt gacattatcg tttcatcttg acgttcttcg 
24961 tcaggaactc gataagaatc tacatcatac cccataagcc acgcttcacc gacatttaaa 
25021 gttttagata ataagaataa tttatgttgg tctggagaag accttccatt aacatactgg 
25081 gataagtgac tttttgacat tttaatattc aa^ttcttttt gaaagggttt cgacttttct 
25141 agaatatcta cttgacgcaa gttcctatct ttcataattt gttttaatct ttcagaagtg 
25201 ttttgcattg gtaatgcctc cttgaaattc attatatagg aagggaaata aaaatcaata 
25261 caaaagttca acttttttaa ctttttgtgt tgacattgtt caaaattggg gttatagtta 
25321 ttatagttca aatgtttgaa cttaggaggt gattatttga atactaatac aacttttgat 
25381 ttttcgttat tgaacggtaa gatagtcgaa gtgtactcga cacaatttaa ctttgctata 
25441 gctttaggtg tatcagaaag aactttgtct ttgaagttga acaacaaagt* accatggaaa 
25501 acaacagaca ttattaaagc ttgtaagtta ttgggaatac ctataaaaga tgttcacaaa 
25561 tafcttitttta aacagaaagt tcaaatgttt gaacttaata agtaaaggag gcataacaca 
25621 tgcaagaacg agaaaaggtt aataaaagta acacatcttc aaatgaagca tcaaaacctt 
25681 ttaggacaaa ttgaagctta cgacaaaacg cttaaagaaa taaagtacac tcgagacctt 
25741 tacaacaaac acctaagcat gaacaacgaa gacgcattcg ctggtttgga aatggtagag 
25801 gatgaaatta ctaaaaagct acgaagtgct atcaaagagt tccaaaaagt agtgaaagcg 
25861 ttagacaagc ttaacggtgt tgaaagcgat aacaaagtta ctgatttaac agagtggcgg 
25921 aaagtgaatc agtaacattc acttcttaat ataaccacgc ttatcaacat ccacattgag 
25981 cagatgtgag cgagagctgg cgatgatatg agccgcgttt aaatacattc gatagtcatt 
26041 gcgataaccg- tctgctgaat gtgggtgttg aggaaaaagg aggatactca aatgcaagca 
26101 ttacaaacat ttaattttaa agagctacca gtaagaacag tagaaattga aaacgaacct 
26161 tattttgtag gaaaagatat tgctgagatt ttaggatatg caagatcaaa caatgccatt 
26221 agaaatcatg ttgatagcga ggacaagctg acgcaccaat ttagtgcatc aggtcaaaac 
26281 agaaatatga tcattatcaa cgaatcagga ttatacagtc taatcttcga tgcttctaaa 
26341 caaagcaaaa acgaaaaaat tagagaaacc gctagaaaat tcaaacgctg ggtaacatca 
26401 gatgtcctac cagctattcg caaacacggt atatacgcaa cagacaatgt aattgaacaa 
26461 acattaaaag atccagacta catcattaca gtgttgactg agtataagaa agaaaaagag 
26521 caaaacttac ttttacaaca gcaagtagaa gttaacaaac caaaagtatt attcgctgac 
26581 tcggtagctg gtagtgataa ttcaatactt gttggagaac tagcgaaaat acttaaacaa 
26641 aacggtgttg atataggaca aaacagattg ttcaaatggt taagaaataa tggatatctc 
26701 attaaaaaga gtggagaaag ttataactta ccaactcaaa agagtatgga tctaaaaatc 
26761 ttggatatca aaaaacgaat aattaataat ccagatggtt caagtaaagt atcacgtaca 
26821 ccaaaagtaa caggcaaagg acaacaatac tttgttaata agtttfctagg agaaaaacaa 
26881 acatcttaaa aggaggaaca caatggaaca aatcacatta accaaagaag agttgaaaga 
26941 aattatagca aaagaagtta gagaggctat aaatggcaag aaaccaatca gttcaggttc 
27001 aattttcagfc aaagtaagaa tcaataatga cgatttagaa gaaatcaata aaaaactcaa 
27061 tttcgcaaaa gatttgtcgc taggaagatt gaggaagctc aatcatccga ttccgctaaa 
27121 aaagtatcag catggcttcg aatcaattca tcaaaaagct tatgfcacaag atgttcatga 
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27181 ccatattaga aaattaacat tatcaatttt tggagtgaca cttaattcag acttgagtga 
27241 aagtgaatac aacctagcag caaaagtfcta tcgagaaatc aaaaactatt atttatacat 
27301 ctatgaaaag agagtttcag aattaactat cgatgatttc gaataaagga ggaacaacaa 
27361 atgttacaaa aatttagaat tgcgaaagaa aaaaataaat taaaactcaa attactcaag 
27421 catgctagtt actgtttaga aagaaacaac aaccctgaac tgttgcgagc agttgcagag 
27481 ttgttgaaaa aggttagcta aattcaacgg taaggatttg ccctgcctcc acacttagag 
27541 tttgagatcc aacaaacaca taagttttag tagggtctag aaaaaatgtt tcgatttcct 
27601 cttttgtaac agtttcaatt cctt'catatc ctggaaaaac aattttcttt aaatccgaaa 
27661 catgtttttt tgaaccatcc tttaaagtaa ctagaagttt catacttatc acctccttag 
27721 gttgataaca acattataca cgaaaggagc ataaacaata tgcaagcatt acaaacaaat 
277.81 tcgaacatcg gagaaatgtt caatattcaa gaaaaagaaa atggagaaat cgcaatcagc 
27841 ggtcgagaac ttcatcaagc attagaagtt aagacagcat ataaagattg gtttccaaga 
27901 atgcttaaat acggatttga agaaaataca gattacacag ctatcgctca aaaaagagca 
27^61 acagctcaag gcaatatgac tcactatatt gaccacgcac tcacactaga cactgcaaaa 
28021 gaaatcgcaa tgattcaacg tagtgaacct ggcaaacgtg caagacaata tttcatccaa 
28081 gttgaaaaag catggaacag cccagaaatg attatgcaac gtgctttaaa aattgctaac 
28141 aacacaatca atcaattaga aacaaagatt gcacgtgaca aaccaaaaat tgtatttgca 
28201 gatgcagtag ctactactaa gacatcaatt ttagttggag agttagcaaa gatcattaaa 
28261 caaaacggta taaacatcgg gcaacgcaga ttgtttgagt ggttacgtca aaacggattc 
28321 cttattaaac gcaagggtgt ggattataac atgcctacac agtattcaat ggaacgtgag 
283B1 ttattcgaaa ttaaagaaac atcaatcaca cattcggacg gtcacacatc aattagtaag 
28441 acgccaaaag taacaggtaa aggacaacaa tactttgtta acaagttttt aggagaaaaa 
28501 caaacaactt aataggagga attacaaatg aacgcactat acaaaacaac cctcctcatc 
28561 acaatggcag ttgtgacgtg gaaggtttgg aagattgaga agcacactag aaaacctgtg 
28621 attagtagca gggcgttgag tgactatcta aacaacaaat ctttaaccafc accgaaagat 
28681 gctgaaaatt ctactgaatc tgctcgtcgc cttttgaagt tcgccgaaca aactattagc 
28741 aaataacaac attatacacg aaaggaaaga tagaaatgcc aaaaatcata gtaccaccaa 
28 801 caccagaaaa cacatataga ggcgaagaaa aatttgtgaa aaagttatac gcaacaccta 
28861 cacaaatcca tcaattgttt ggagtatgta gaagtacagt atacaactgg ttgaaatatt 
28921 accgcaaaga taatttaggt gtagaaaatt tatacattga ttattcacca acaggcactc 
28981 tgattaatat ttctaaattg' gaagagtatt tgatcagaaa gcataaaaaa tggtattagg 
29041 aggatattaa atgagcaaca tttataaaag ctacctagta gcagtattat gcttcacagt 
29101 cttagcgatt gtacttatgc cgttfcctaba cttcactaca gcatggtcaa ttgcgggatt 
29161 cgcaagtatc gcaacattca tgtactacaa agaatgcttt ttcaaagaat aaaaaaactg 
29221 ctacttgttg gagcaagtaa cagtatcaaa cacttaagaa aaaattcatg ttcaatataa 
29281 aacgaaaaac ggaggaagtc aagatgtatt acgaaatagg cgaaatcata cgcaaaaata 
29341 ttcatgttaa cggattcgat tttaagctat tcattttaaa aggtcatatg ggcatatcaa 
29401 tacaagttaa agatatgaac aacgtaccaa ttaaacatgc ttatgtcgta gatgagaatg 
29461 acttagatat ggcatcagac ttatttaacc aagcaataga tgaatggatt gaagagaaca 
29521 cagacgaaca ggacagacha attaacttag tcatgaaatg gtaggaggtc gctatgaagc 
29581 agactgtaac ttatatcatt cgtcataggg atatgccaat ttatataact aacaaaccaa 
29641 ctgataacaa ttcagatatt agttactcca caaatagaaa tagagctagg gagtttaacg 
29701 gtatggaaga agcgagtatc aatatggatt atcacaaagc aatcaagaaa acagtgacag 
29761 aaactatfcga gtacgaggag gtagaacatg actgaggaaa aacaagaacc acaagaaaaa 
2 9821 gtaagcatac tcaaaaaact aaagataaat aatatcgctg agaaaaataa aaggaaattc 
29881 tataaatttg cagtatacgg aaaaattggc tcaggaaaaa ccacgtttgc tacaagagat 
29941 aaagacgctt tcgtcattga cattaacgaa ggtggaacaa cggttactga cgaaggatca 
3O001 gacgtagaaa tcgagaacta tcaacacttt gtttatgttg taaatttttt acctcaaatt 
30061 ttacaggaga tgagagaaaa cggacaagaa atcaatgttg tagttattga aactattcaa 
30121 aaacttagag atatgacatt gaatgatgtg atgaaaaata agtctaaaaa accaacgttt 
30181 aatgattggg gagaagttgc tgaacgaatt gtcagtatgt acagattaat aggaaaactt 
30241 caagaagaat acaaattcca ctttgttatt acaggtcatg aaggtatcaa caaagataaa 
30301 gatgatgaag gtagcactat caaccctact atcactattg aagcgcaaga acaaattaaa 
30361 aaagctatta cttctcaaag tgatgtgtta gctagggcaa tgattgaaga atttgatgat 
30421 aacggagaaa agaaagctag atatattcta aacgctgaac cttctaatac gtttgaaaca 
30481 aagattagac attcaccttc aataacaatt aacaataaga aatttgcaaa tcctagcatt 
30541 acggacgtag tagaagcaat tagaaatgga aactaaaaat taattaaaag gacggtattt 
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30601 aattatgaaa 
3 0Sei taacggctca 
30721 caatgataga 
30781 taaacataat 
30S41 attagttact 
30901 tcttattggt 
30961 gtattttacg 
31021 acctattccg 
31081 atcaatgtct 
31141 agatttagcg 
31201 cgacggtact 
31261 actagaaaac 
31321 tatagaacaa 
31381 accagtagaa 
31441 agaaatcagt 
31501 agcgtttatg 
31561 agataaagcg 
31621 tcacgcagac 
31681 ccactatgac 
31741 tggcgttaag 
31801 gaggctcaat 
31861 atagcactcc 
31921 gtggagaaaa 
31981 aggcggacaa 
32041 ttaafcacttc 
32101 ttaacattct 
32161 taaagtgatt 
32221 aagacatgtc 
32281 cggttgattc 
32341 cattctttgg 
32401 aatgttttaa 
32461 ggtgggttaa 
32521 tacttaaagt 
32581 aaaattgtgg 
32641 ctttgaattt 
32701 attfccccaaa 
32761 cttcaataat 
32821 ctaacgcaat 
32881 aagttactac 
32941 aaagttactt 
33001 taatggttac 
33 061 gaaccttacc 
33121 acaaaggaag 
33181 ccctattgat 
33241 tattaataat 
33301 agcatcttct 
33361 ttttaaacac 
33421 taggttggag 
33481 tagcgataaa 
'33541 tcaaaaaata 
33601 ttgggattag 
33661 aaaatatcaa 
33721 cgacttatat 
33781 ttgcaaatgt 
33841 attcaatcaa 
33901 acaaaatgaa 
33 961 tacaaaagaa 



atcaoaggac 
gcagggtttc 
gaaaatagat 
caatttgtac 
cgattaggta 
aagttttgtc 
gatttttcat 
aagacagata 
caacaaagca 
ttttaaggtg 
tattccgtcg 
ggatatccac 
cgcaaaaaaa 
tcaactagaa 
ctgcgcgact 
tttcatcafcc 
ttattatatt 
ctggcacatt 
aaacatgtat 
tcgtttgatg 
aaaatgttga 
taatcgtcat 
ttttaaaatc 
actaattgag 
aaattcaatg 
tttaacaaat 
taaaaactga 
aaaagtttca 
tatatctaac 
gtttaaaacc 
aagaatagca 
tgagtttttt 
tttttcacta 
ttcttgtaaa 
ttcaaattct 
gacaagttcc 
tttatcaata 
agcgataata 
tcaataatta 
tttgcagaaa 
tttgcaacfct 
aactttggtt 
atgtacccct 
aattctgtca 
acaagtaata 
ataccctata 
aatacagcta 
gattttaaaa 
taccttagac 
caaccaactg 
ggggatatta 
cctactcatg 
aagtttgctc 
gaaatctatg 
tcaaacgtta 
aaacaagtac 
ccaaaatcat 



aagcgcaatt 
aagctggaga 
atttcacaat 
cgccgtataa 
ttaagttaaa 
acttggtatt 
ttattaaacc 
agcaaaaagc 
atccatttga 
tggtttaaat 
ttgctactgg 
taaaagcaga 
tattcgcaat 
aattattaca 
gttctatgaa 
aaatacctat 
gggctacaat 
atgaagcagt 
tagcgttatg 
ataaatacca 
aaggagagaa 
cttggcggaa 
tccgtttagt 
ccttttttga 
ccagaaagtt 
tctaatcccg 
ggagcataaa 
tttaaaaccc 
ggagagtctt 
gctctatatt 
tcatttgggg 
ctgtcatcca 
atgtaaaact 
ttatttttag 
acttctcttt 
caagttttag 
cctttaccta 
aaattatacc 
cagcaaatgt 
taacatcttt 
tatacaacgt 
atctaaaaat 
tgacgcaaac 
atacccctat 
acaatataaa 
aagaaattat 
aaacaaaaga 
aggtgattga 
cagaaacact 
gcacggatca 
tgaaaccact 
tcgaaaaagg 
ctactaaaaa 
aggaatataa 
atccgtcttfc 
acgctaaaca 
taatattgca 



tactaaagaa 
attcacagtg 
cgtatttgaa 
atatgatttc 
tcttcctagc 
gaaatggaaa 
ttacaaaaag 
tgaagaaaat 
aagcagtggc 
gcaatacatt 
tgttgaactt 
agtagaggtt 
gtgtagagat 
aacagaattg 
agttgcaagg 
gagtgtagaa 
caaccgcaac 
cggcagaggc 
tcgcgaacat 
cttgcatgac 
aaaggaatga 
gagattagaa 
taatacaggt 
tgtctattac 
tacttattgt 
aaacaaatct 
acttattata 
ctaaccttac 
ttattaacgt 
taacggcagg 
ataattgttt 
tagatgatgc 
ttgaagcttc 
gtacagaaga 
gataaataac 
agaatgtttc 
aaataggatc 
agaaaggaga 
cagatacgat 
aagtaacaaa 
tgttaaggaa 
cgaaattatc 
gtcaatacct 
tgacgcaaat 
tagaatagat 
cgattactta 
ttttattaaa 
tatcaaaaca 
ttttggcagt 
attggaacgc 
attcagcgaa 
attgaaatgt 
acacccgaat 
gcgaaacaag 
aagagatgca 
aacagcaata 
iaggttcatac 



acaaatcaag 
aaagttaaaa 
aatgatgaag 
caagaaaaac 
ttagattttg 
ttcaatgaag 
ggcgatgatg 
aacggggcac 
caatttggat 
acaagatacc 
gaacaaagtc 
ccggacaata 
atagaacttc 
gaaattatga 
gagttaatag 
acgagtaagt 
tgtgtaafcat 
atgaacagaa 
cacaacgagc 
tcgtggataa 
atagactaag 
atgctatgca 
ttttacaaaa 
ccaggggctg 
ttctaggttg 
ttgtttttct 
aattcctttt 
taggttatta 
gtccgatata 
atgtacttcg 
aattatttca 
tattagtttt 
tagagcagga 
tatttctttt 
tttatccaca 
tacaggccct 
cataattatt 
atcaacatga 
aaccgactta 
tacggatact 
actatatctc 
aaagaaggta 
attgacgcaa 
gtcaaagaga 
atattgtcgg 
aacaaaaaag 
gcaagafcgga 
gctgagtggc 
aaatttgagg 
atgaagtacg 
aagataaacg 
gagagatgtg 
ggttacgagt 
caacggaaga 
acagtcaaaa 
gagtacgtac 
ggaactggta 



aaaagtttta 
atafctgaatt 
gcaaacaata 
aatcgattga 
ataccaatga 
atgaaggtaa 
ttgttaacaa 
aacaacaaac 
atgacgacca 
agaaagataa 
acattgactt 
aaaaactatc 
actggggcga 
aaggttatga 
aactgattat 
tgttaagcga 
gcggaaagcc 
acaaaatgaa 
aacatgcgat 
aagttgatga 
aataataaaa 
tgctgtaaaa 
gctttaccat 
taatgtaact 
tgtcctgact 
ataatcttat 
tttgttaagt 
attgaaattt 
ttcataccgt 
tgattcttta 
acaaatgaat 
gcgaacatat 
cctagaagag 
ttaaattgtt 
taaaggtgga 
tttgatgcgc 
cacccccaat 
ctgaccaacc 
ctgacagcga 
gcacagcaag 
gtagaatttc 
atgaagttaa 
aaatcaatac 
atattacaag 
gcaacccgac 
cgggcaagca 
atcaagattt 
taaacacgga 
ggtacctcaa 
acgaaagtta 
aaagcttgaa 
gaagtgaata 
ataaagacgg 
taaacaacat 
actacaagcc 
aaggcttctc 
aaagccacct 
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34021 agcatacgct atcgcaaaag cagtcaaagc taaagggcat acggttgctt ttatgcacat 
34081 accaatgttg atggatcgta tcaaagcgac atacaacaaa aatgcagtag agactacaga 
34141 cgagctagtc agattgctaa gtgatattga tttacttgta ctagatgata tgggtgtaga 
34201 aaacacagag cacactttaa ataaactttt cagcattgtt gataacagag taggtaaaaa 
34261 caacatcttt acaactaact ttagtgataa agaactaaat caaaatatga actggcaacg 
34321 tataaattcg agaatgaaaa aaagagcaag aaaagtaaga gtaatcggag acgatttcag 
343 81 ggagcgagat gcatggtaac caaagaattt ttaaaaacta aacttgagtg ttcagatatg 
34441 tacgctcaga aactcataga tgaggcacag ggcgatgaaa ataggttgta cgacctattt 
34501 atccaaaaac ttgcagaacg tcatacacgc cccgctatcg tcgaatatta aggagtgtta 
34561 aaaatgccga aagaaaaata ttacttatac cgagaagatg gcacagaaga tattaaggtc 
34621 atcaagtata aagacaacgt aaatgaggtt tattcgctca caggagccca tttcagcgac 
34681 gaaaagaaaa ttatgactga tagtgaccta aaacgattca aaggcgctca cgggcttcta 
34741 tatgagcaag aattaggttt acaagcaacg atatttgata tttagaggtg gacgatgagt 
34801 aaatacaacg ctaagaaagt tgagtacaaa ggaattgtat ttgatagcaa agtagagtgt 
34861 gaatattacc aatatttaga aagtaatatg aatggcacta attatgatca tatcgaaata 
34921 caaccgaaat tcgaattatt accaaaacta gataaacaac gaaagattga atatattgca 
34981 gacttcgcgt tatatctcga tggcaaactg attgaagtta tcgacattaa aggtatgcca 
35041 accgaagtag caaaacttaa agctaagatt ttcagacata aatacagaaa cataaaactc 
35101 aattggatat gtaaagcgcc taagtataca ggtaaaacat ggattacgta cgaggaatta 
35161 attaaagcaa gacgagaacg caaaagagaa atgaagtgat ctaatgcaac aacaagcata 
35221 tataaatgca acgattgata taaggatacc tacagaagtt gaatatcagc attttgatga 
35281 tgtggataaa gaaaaagaag cgctggcaga ttacttatat aacaatcctg acgaaatact 
35341 agagtatgac aatttaaaaa ttagaaacgt aaatgtagag gtggaataaa tgggcagtgt 
35401 tgtaatcatt aataataaac catataaatt taacaatttt gaaaaaagaa ataatggcaa 
35461 agcgtgggat aaatgctgga attgtttcta aacgtgttag aggttgttgg gagttttcag 
35521 aagctttaga cgcgccttat ggcatgcacc taaaagaafca bagagaaatg aaacaaatgg 
35581 aaaagattaa acaagcgaga ctcgaacgtg aattggaaag agagcgaaag aaagaggctg 
35641 agctacgtaa gaagaagcca catttgttta atgtacctca aaaacattca cgtgatccgt 
35701 actggttcga tgtcacttat aaccaaatgt tcaagaaatg gagtgaagca taatgagcat 
35761 aatcagtaac agaaaagtag atatgaacaa aacgcaagac aacgttaagc aacctgcgca 
35821 ttacacatac ggcgacattg aaattataga ttttattgaa caagttacgg cacagtaccc 
35881 accacaatta gcattcgcaa taggtaatgc aattaaatac ttgtctagag caccgttaaa 
35941 gaatggtcat gaggatttag caaaggcgaa gttttacgtc gatagagtat ttgacttgtg 
36001 ggagtgatga ccatgacaga tagcggacgt aaagaatact taaaacattt tttcggctct 
36061 aagagatatc tgtatcagga taacgaacga gtggcacata tccatgtagt aaatggcact 
36121 tattactttc acggtcatat cgtgccaggt tggcaaggtg tgaaaaagac atttgataca 
36181 gcggaagagc ttgaaacata tataaagcaa agtgatttgg aatatgagga acagaagcaa 
36241 ctaactttat tttaaaaggg cggaaacaat gaaaatcaaa attgaaaaag aaatgaattt 
36301 acctgaactt atccaatggg cttgggataa ccccaagtta tcaggtaata aaagattcta 
36361 ttcaaatgat gttgagcgca actgttttgt gacttttcat gttgatagca tcttatgtaa 
36421 tgtgactgga tatgtatcaa ttaacgataa atttactgtt caagaggaga tataacaatg 
36481 aaaatcaaag ttaaaaaaga aatgagatta gatgaattaa ttaaatgggc gcgagaaaat 
36541 ccggatctat cacaaggaaa aatatttttt tcaacaggat ttagtgatgg attcgttcgt 
36601 tttcatccaa atacaaataa gtgttcgacg tcaagtttta ttccaattga tatccccttc 
36661 atagttgata ttgaaaaaga agtaacggaa gagactaagg ttgataggtt gattgaatta 
36721 ttcgagattc aagaaggaga ctataactct acactatatg agaacactag tataaaagaa 
36781 tgtttatatg gcagatgtgt gcctaccaaa gcattctaca tcttaaacga tgacctaact 
36841 atgacgttaa tctggaaaga tggggagttg ctagtatgat gttgaaattt aaagcttggg 
36901 ataaagataa aaaagttatg agtattattg acgaaatcga ttttaatagfc gggtacattt 
36961 tgatttcaac aggttataaa agtttcaatg aagtaaaact attacaatac acaggattta 
37021 aagatgtgca cggtgtggag atttatgaag gggatattgt tcaagattgt tattcgagag 
37031 aagtaagttt tatcgagttt aaagaaggag ccttttatat aacttttagc aatgtaactg 
37141 aattactaag tgaaaatgac gatattattg aaattgttgg aaatattttt gaaaatgaga 
37201 tgctattgga ggttatgaga tgacgfctcac cttatcagat gaacaatata aaaatctttg 
37261 tactaactct aacaagttat tagataaact tcacaaagca ttaaaagatc gtgaagagta 
37321 caagaagcaa cgagatgagc ttattgggga tatagcgaag ttacgagatt gtaacaaaga 
37381 tctagagaag aaagcaagcg catgggatag gtattgcaag agcgttgaaa aagatttaat 



13/39 



wo 01/46383 



PCT/USOO/35180 



37441 aaacgaattc ggtaacgatg atgaaagagt taaattcgga atggaattaa acaataaaat 
37501 ttttatggag gatgacacaa atgaataatc gcgaaaaaat cgaacagtcc gttattagtg 
37561 ctagfcgcgta taacggtaat gacacagagg ggttgctaaa agagattgag gacgtgtata 
37621 agaaagcgca agcgtttgat gaaatacttg agggaatgac aaatgctatfc caacatccag 
37681 ttaaagaagg tattgaactt gatgaagcag tagggattat ggcaggtcaa gttgtctata 
37741 aatatgagga ggaataggaa aatgactaac acattacaag taaaactatt atcaaaaaat 
37801 gctagaatgc ccgaacgaaa tcataagacg gatgcaggtt atgacatatt ctcagctgaa 
37861 actgtcgtac tcgaaccaca agaaaaagca gtgatcaaaa cagatgtagc tgtgagtata 
37921 ccagagggct atgtcggact attaactagt cgtagtggtg taagtagtaa aacgtattta 
37981 gtgattgaaa caggcaagat agacgcggga tatcatggca atttagggat taatatcaag 
38041 aatgatgaag aacgtgatgg aatacccttt ttatatgatg atatagacgc tgaattagaa 
38101 gatggattaa taagcatttt agatataaaa ggtaactatg tacaagatgg aagaggcata 
38161 agaagagttt accaaatcaa caaaggcgat aaactagctc aattggttat cgtgcctata 
3 8221 tggacaccgg aactaaagca agtggaggaa ttcgaaagtg tttcagaacg tggagcaaaa 
38281 ggcttcggaa gtagcggagt gtaaagacat cttagatcga gttaaggagg ttttggggaa 
38341 gtgacgcaat acttagtcac aacattcaaa gatbcaacag gacgaccaca tgaacatatt 
38401 actgtggcta gagataatca gacgtttaca gttattgagg cagagagtaa agaagaagcg 
38461 aaagagaagt acgaggcaca agttaaaaga gatgcagtta ttaaagtggg tcagttgtat 
38521 gaaaatataa gggagtgtgg gaaatgacgg atgttaaaat taaaactatt tcaggtggag 
38581 tttattttgt aaaaacagct gaaccttttg aaaaatatgt tgaaagaatg acgagtttta 
38641 atggttatat ttacgcaagt actataatca agaaaccaac gtatattaaa acagatacga 
38701 ttgaatcaat cacacttatt gaggagcatg ggaaatgaat cagctgagaa ttttattaca 
38761 tgacggtagt agtttgatat tacatgaaga tgaa.ttattt aacgaaatag tatttgtttt 
38821 ggacaatttt agaaatgatg atgactattt aacgatagaa aaagattatg gcagagaact 
38881 tgtattgaac aaaggttata tagttgggat caatgttgag gaggcagatg atgattaaca 
38941 tacctaaaat gaaattcccg aaaaagtaca ctgaaataat caaaaaatat aaaaataaag 
39001 cacctgaaga aaaggctaag attgaagatg attttattaa agaaattaaa. gataaagaca 
39061 gtgaatttta cagtcctacg atggctaata tgaatgaata tgaattaagg gctatgttaa 
39121 gaatgatgcc tagtttaatt gatactggag atgacaatga tgattaaaaa acttaaaaat 
39181 atggatgggt tcgacatctt tattgttgga atactgtcat tattcggtat atfccgcattg 
39241 ctacttgtta tcacattgcc tatctataca gtggctagtt iaccaacacaa agaattacat 
39301. caaggaacta ttacagataa acataacaag agacaagata aagaagacaa gttctatatt 
39361 gtattagaca acaaacaagt cattgaaaat tccgacttat tattcaaaaa gaaatttgat 
39421 agcgcagata tacaagctag gttaaaagta ggcgataagg tagaagttaa aacaatcggt 
39481 tatagaatac actttttaaa tttatatccg gtcttatacg aagtaaagaa ggtagataaa 
39541 caatgattaa acaaatacta agactattat tcttactagc aatgtatgag ttaggtaagt 
39601 atgtaactga gcaagtgtat attatgatga cggctaatga tgatgtagag gcgccgagtg 
39661 attacgtctt tcgagcggag gtgagtgaat aatgagaata tttatttatg atttgatcgt 
3 9721 tttgctgttt gctttcttaa tatccatata tattattgat gatggagtga taataaatgc 
39781 attaggaatt tttggtatgt ataaaattat agattccttt tcagaaaata ttataaagag 
39841 gtagataaaa atgaacgagc aaataatagg aagcatatat actttagcag gaggtgttgt 
39901 gctttattca gttaaagaga tttttaggta ttttacagat tctaacttac aacgtaaaaa 
39961 aatcaattta gaacaaatat atccgatata tttagattgt tttaaaaagg ctaaaaagat 
40021 gattggagct tatattattc caacagaaca gcatgaattt ttagattttt ttgatattga 
40081 agtctttaat aatttagata agcaaagtaa aaaagcgtat gaaaatgtta ttggatttag 
40141 acaaatgatt aatttatcaa atagagttaa ggcaatggaa gattttaaga tgagtttcaa 
40201 caatgaattt agtacaaatc agattttttt taaitccttct tttgttatgg aaacaattgc 
40261 tattataaat gaatatcaaa aagatatatc ttatttaaaa aatataatta ataaaatgaa 
40321 tgaaaataga gcttataatc atattgatag ttttatcact tcagagtacc gacgaaaaat 
40381 aaacgattat aatctttatc ttgataaatt tgaagaacag tttagtcaaa agtttaaaat 
40441 aaacagaact tcgataaaag aaagaattat tattaattta aacaagagga gatttaaatg 
40501 atgtggatta ctatgactat tgtatttgct atattgctat tagtttgtat cagtattaat 
40561 agtgatcgtg caagagagat acaagcactt agatatatga atgattatct acttgatgaa 
40621 gtagttaaaa ctaaagggta caacgggtta gaagaataca ggattgaatt gaagcgaatg 
40681 aataacgata ttaaaaagta atttatatta tcggaggtat tgcattgaat gataaagatt 
40741 gagaaacacg atatcaaaaa gcttgaagaa tacattcagc acatcgataa ctatcgaaga 
40801 gagttgaaga tgcgagaata tgaattactt gaaagtcatg aaccagataa tgcgggagct 
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y 2 Ccoh4 ) 



40861 ggcaaaagta atttgccggg taacccgatt gaacgatgtg caataaagaa gtttagtgat 
40921 aacaggtaca atacattaag aaatatagtt aacggtgtag atagattgat aggtgaaagt 
40981 gatgaggata cgcttgagtt attaaggttt agatattggg attgtcctat tggttgttat 
41041 gaatgggaag atatagcaca ttactttggt acaagtaaga caagtatatt acgtagaagg 
41101 aatgcactga tcgataagtt agcaaagtat attggttatg tgtagcggac ttttacccta 
41161 tgtaagtccg cattaaaaca gtttattatg ttagtatcag attaatattt aaagttatta 
41221 aatgctaata cgacgcatga acaagaggcg- catcactatg tgatgtgtct ttttatttat 
41281 gaggtatgaa catgttcaaa ctaa-ttgtaa ataCattact acacatcaag tatagatgag 
41341 tcttgatact acttaagtta tataaggtga aacattatga tgactaaaga cgaacgtata 
41401 cgattctata agtctaaaga atggcaaata acaagaaaaa gagtgctaga aagagataat 
41461 tatgaatgtc aacaatgtaa gagagacggc aagttaacga catatgacaa aagcaagcgt 
41521 aagtcgttgg atgtagatca tatattatcg ctagaacatc atccggagtt tgctcatgac 
41581 ttaaacaatt tagaaacact gtgtattaaa tgtcacaaca aaaaagaaaa gagatttata 
41641 aaaaaagaaa ataaatggaa agacgaaaaa tggtaaatac ccccgggtca aaaaaatcaa 
41701 aagcgatc 



15/39 



wo 01/46383 



PCT/USOO/35180 



I 



ta 

s 



Qi CO 

sz 

CL at 

0 N 

«^ 

01 e 



Si 



» - 
to 




16/39 



wo 01/46383 



PCT/USOO/35180 



S£Q ID 2^0) 4 

1 atggtaacca aagaattttt aaaaactaaa cttgagtgtt cagatatgfca cgctcagaaa 
61 ctcatagatg aggcacaggg cgatgaaaat aggttgtacg acctatttat ccaaaaactt 
121 gcagaacgtc atacacgccc cgctatcgtc gaatattaa 



SEQ ID NOt 5 • 

!• MVrKEFLKTK LECSDMyAQK LIDKAQGDEW RLYDLFIQKI. AERKTRPAIV EY 
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•5 



Predicted Tryptic Peptide Masses of Conceptual OFF in Contig 1 383: 



t 

31 
61 
91 
121 
151 
181 
211 
241 
271 
301 



MGGGQS IMKq fkS I INTSQDFEKr lEK-i KK 30 

evindpdvkQFLEAHRaef inamided ( nv eo 

I qeykDQQKhydghkFADCPNFVK g h v p e I so 

y V d n n r J KirYLQCPCKikYDEERfeaeli 120 

t s n n m q rDTLNAKI kD lYM.NHRdrLDVAMA 150 

ADDICTAITNGEQVKglylygpfgtgkSFI 180 

LGAlANQLKskKvr S T I i Y L P E F i R I I k G G 21Q 

FKdgsfekKihrVReani Imlaaigaeevt 240 

pwvrDEVIGPLLHYRmvheiptffssnfdy 270 

selehhiamt rDGEEKtkAARiierVK s I s 3oo 

t p y f I s g e n f r N N 313 



Tryptic peptide fragment: 
GHVPELYVDNNR 
SXnYLPEEIR 
SLSTPYFLSGENFR 



Predicted Peptide Mass MH-f = 1413.538 
Predicted Peptide Mass MH+ =; 1351.6221 
Predicted Peptide Mass MH+ = 1618.7923 
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SKQ ID NO: 6 DnaC nucleotide B. subtilljiB 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 



atgacagacc 
acaagccgtg 
cttcagaagt 
atctataatg 
ggtgacagtt 
gcggtatttc 
aacatagaat 
attaatcaga 
atgaggtcga 
gcacagcgca 
ccagacctat 
cgggaattcc 
cagcgcaacg 
agcctttgcc 
gcgtagcgat 
atgctctgtg 
cctgaccgaa 
cgaacagcgg 
atccgtgcca 
tttgatcgat 
accgtcagca 
agggagctgc 
tgagcagcgt 
gaagtatcga 
tactatgaca 
caaacagcgt 
aatacaacaa 
ccgcccggcg 



ttctgaatga 
ttaggcgcba 
attgattcca 
cgatgctggt 
acatcagagc 
atatttgaca 
attacgcgaa 
actgcgacaa 
ggatttactc 
aaaacacgag 
gataatatcg 
aacagggttt 
acttgatcat 
ctgaacatcg 
tttcagtctt 
ccgagggaaa 
gaggattggg 
gatttacatc 
agtgccgccg 
tacctgcaat 
ggaagtatct 
aagtccctgt 
caggataaac 
gcaggacgcg 
aagaaaccga 
aacggcccgg 
attcgtcaac 
ca 



ccggcttcct 
tttttttaca 
gatgatttct 
gctcggtgac 
ttgcgaacac 
gatatcgcaa 
aatcgttgag 
cgattgctca 
agtgaagcgg 
tgccttccaa 
aacagcttta 
acggagcttg 
tgtggctgcc 
cacaaaacgt 
gagatgggtg 
tatcaatgcc 
gcaagctgac 
gatgatacac 
cttgaagcag 
tgattcaggg 
gaaatttccc 
tatcgcgctt 
gtccgatgat 
gatattgtcg 
gaataaaaat 
taggaaccgt 
ctggaacggc 



ccgcaaaata 
gccgtctgct 
atagaatgtc 
cgaggtgaac 
agacctgctg 
actcggtgcc 
gaaaaatcga 
agacgggtat 
aaaaaacgat: 
aatattaagg 
caatcgaaaa 
accggatgac 
cgtccttcag 
ggcgacgaag 
ccgagcagct 
cagaatctcc 
gatggcaatg 
cgggtattcg 
gaaagcgggc 
aagcggtcgt 
gtgaactgaa 
tctcagcttt 
gtctgatatc 
cgttccttta 
attatcgaaa 
gtctcttgcg 
gttttgatga 



tagaagccga 
ttaacactgg 
ccaccaaaaa 
cggttgatct 
gaagaagtag 
gacagcggct 
ttcttcgccg 
acccgtgagg 
tatggaagtg 
acgtccttgt 
ggtgatatca 
tgcgggtttc 
tagggaaaac 
accgatgaga 
cgttatgcgt 
gtacaggtaa 
ggaagcctat 
agtgagtgaa 
tgggcatgat 
tcaaaggaca 
gtcgattgcg 
ccaggggtgt 
cgggaatcag 
tcgtgatgac 
ttafctatcgc 
ttcgtaaaag 
cgcaggcgtt 



SEQ ID NO: 7 



DnaC nucleotide 3, aureus 



1 


ATGGATAGAA 


TGTATGAGCA 


51 


ACAGTCTGTC 


TTAGGTTCAA 


101 


CTCAGOAAGT 


TTTGCTTCCT 


151 


ATTTTCCGTG 


CAATGATGCA 


201 


TGTAACATTG 


ATGGATCAAT 


251 


GXGGCCCGCA 


ATATCTTGCA 


301 


AATGTTCAGT 


ATTATACTGA 


351 


ATTGATXCAA 


ACTGCAGATA 


401 


TXGAACTAGA 


TGCGATTTTA 


451 


TCATCTTCTC 


GTGAAAGCGA 


501 


ACAAGTGTAT 


GAAACAGCTG 


551 


CAGGTATACC 


TACAGGATAT 


601 


AACCGAAATG 


ATTTAATTAT 


651 


TGCGTTCQCA 


CTTAATATTG 


701 


ATACAGTTG6 


TATTTTCTCG 


751 


CGTATGATTT 


GTAGXTCTGG 


801 


TACTATGACT 


GAGGAAGAXX 


851 


TATCACGTAC 


GAAGAXXXXX 


901 


GATTTACGTX 


CTAAAXGXCG 


951 


GATTGTGATT 


GACTACXXAC 


1001 


CCGATAACAG 


ACAACAGGAA 


1051 


TTAGCCCGTG 


AAXXAGAATG 



AAAXCAAAXG 
XTAXXAXAGA 
GAGXCGXrXT 
CTXAAAXGAA 
XAXCGACGGA 
GAGXXAXCTA 
XAXCGXXTCX 
GXAXXGCCAA 
AGTGATGCAG 
XGGCXTXAAA 
AAGAGCXTGA 
CGAGATXTA6 
CCTXGCAGCG 
CACAAAAAGX 
CTAGAGAXGG 
AAAXGXXGAC 
GGAGXCGTXT 
ATXQATGATA 
XCGAXXAAAG 
AGXXGAXTCA 
GTXXCTGAAA 
XCCAGXXATC 



CCGCAXAACA 
XCCAGAAXXG 
AXAGGGGXGC 
GAXAAXAAAG 
AGGXACGXXG 
CAAAXGXACC 
AAGCAXGCAX 
XQAXGGATAX 
AACGXCGAAT 
GACAXXCGAG 
XCAAAAXAGX 
ACCAAAXGAC 
CGXCCAXCXG 
XGCAACGCAT 
GXGCXGATCA 
XCAAACCGCX 
TACTAXAGCG 
CACCGGGXAX 
CAAGAACAXG 
AGQTAGXGGX 
XCXCXCGTAC 
GCATXAAGXC 



AXGAAGCXGA 
ATXAAXACXA 
CCAXCAACAX 
AAAXTGAXGX 
AAXGAAGCGG 
AACGACGCGA 
XAAAACGXAG 
AAXGATGAAC 
TXXAGAGCXA 
ACGXCXXAGG 
GGXCAAACAC 
AGCAGGGXXC 
XAGGXAAGAC 
GAAGATAXGX 
GXXAGCCACA 
XAAGAACGGG 
GXAGGTAAAX 
TCGAAXTAAT 
GCTXAGACAX 
XCACGTGCGX 
AXXAAAAGCA 
AGXXAXCXCG 
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1101 TGGTGTTGAA CAACGACAAG ATAAACGTCC AATGATGAGT GATATTCGT6 

1151 AATCTGGTTC GATTGAGCAA GATGCCGATA TCGTTGCATT CTTATACCGT 

1201 QATGATTACT ATAACCGTGG CGGCGATGAA GATGATGACG ATGATGGTGG 

1251 TTTCGAGCCA CAAACGAATG ATGAAAACGG TGAAATTGAA ATTATCATTG 

13 01 CTAAGCAACG TAACGGTCCA ACAGGCACAG TTAAGTTACA TTTTATGAAA 

1351 CAATATAATA AATTTACCGA TATCGATTAT GCACATGCAG ATATGATGTA 

1401 A 



dnaC nuclootldD sequences allgtunent 



Identical = 785/1413 (O.Sfi), similar = 785/1413 <0,Se) 
Substitution matrix t 10 (matches) , 0 (Ttiisntatches) 
Gap penalty: - (50 + 3 ♦ (gap length)) 



dnaC staph homolog 
dnaC B subtilia 



ATGGATAGAA XGTAXGAGCA AAATCAAAIG CCGCATAACA ATGAAGCTGA 
ATGACAGACC TTCTGAATGA CCGGCTTCCT CCGCAAAATA TAGAAGCCGA 
*** * * * * ***** ** * ***** *« 



dnaC staph homolog 
dnaC B subtilis 



ACAGTCTOTC TTAGGTTCAA TTATTATAGA TCCAGAATTG ATTAATACTA 
ACAAGCaSTG TTAGGCGCTA ITTTTTTACA GCCGTCTGCT TXAACACTGG 
^ ******* ** * * 



*** * ** ***** 



dnaC staph homolog 
dnaC B stibtilis 



CTCAGGAAGT TTTGCTTCCr GAGTCGTTTT ATAGGGGXGC CCATCAACAT 
CTTCAGAAGT ATTGATTCCA GATGATTXX:T ATAGAATOTC CCACCAAAAA 
** ***** *** **** ** ** * **** * ««* *** * 



dnaC staph homolog 
dnaC B sxibtilis 



ATTTTCCGTG CAATGATGCA CTTAAATGAA GATAATAAAG AAATTGATGT 
ATCTATAATG CGATGCTGGT GCTCGGTGAC CGAGGTGAAC CGGTXGATCT 
** * ** * *** ** * *** * ** ***** * 



dnaC staph homolog 
dnaC B subtilis 

dnaC staph homolog 
dnaC B subtilis' 



TGTAACTiTTG ATGGATCAAT TATCGACGGA AGGTACGTTG AATGAAGCGG 
GGTGACAGTT ACATCAGAGC TTGCGAACAC AGACCTGCTG GAAGAAGTAG 
** *** * * * * *** * k **** * 

GTGGCCCGCA ATATCTTGCA GAGTTATCTA CAAATGTACC AACGACGCGA 
GCGGTATTTC ATATTTGACA GATATCGCAA ACTCGGTGCC. GACAGCGGCT 
*f ** **** ******** ** **■ ** ** 



dnaC staph homolog 
dnaC B subtilis 



AATOTTCAGT ATTATACTGA TATCGTXXCT AAQCATGCAT TAAAACGTAG 
AACATAGAAT ATXACGCGAA AATCGTTGAG GAAAAATCGA TTCXTCGCCG 
***** **** * * ****** * * * * ** ♦ 



dnaC staph homolog 
dnaC B subtilis 



ATXGATTCAA ACTGCAGATA GTATTGCCAA T6ATGGATAT AATGATGAAC 
ATTAATCAGA ACTGCGACAA CGATTGCXCA AGACGGGXAT ACCCGTGAGG 

k «**#* * ** ** *«* * *** 



*** ** * ***** 



dnaC staph homolog 
dnaC B subtilis 



TTGAACTAGA TGCGAXTTTA AGTGATGCAG AACGTOGAAT TTTAGAGCTA 
ATGAGGTCGA GGATTTACTC AGXGAAGCGG AAAAAACGAT XATGGAAGTG 
*** * ** * * * ***** ***** ******* 



dnaC staph homolog 
dnaC B subtilis 



TCATCXTCXC GTGAAAGCGA TGGCTXrAAA GACATTCGAG ACGTCTTAGG 
GCACAGCGCA AAAACACGAG TGCCTTCCAA AATATTAAGG ACGTCCTTGT 
** ***** ** **** ***[ir**** 



dnaC staph homolog 
dnaC B subtilis 



ACAAGTGXAT GAAACAGCTG AAGAGCTIGA TCAAAATAGT GGXCAAACAC 
CCAGACCTAT GATAATATCO AACAGCIXTA CAATCGAAAA GGXGATATCA 
** *** ** * * ** ***** * * * *** * * 



dnaC staph homolog 
dnaC B subtilis 



CAGGXATACC TACAGGATAT CGAGAXXTAG ACCAAATGAC AGCAGGQTTC 
CGGGAATXCC AACAGGGTXT ACGGAGCTTG ACCGGATGAC TGCGGGTTTC 
* ** ** ** ***** * * ** * * *** ***** ** ** *** 



dnac staph homolog 
dnaC B subtilis 



AACOGAAATG ATTTAAXTAT CCTTGCAGCtS CGXCCATCTG TAGGTAA6AC 
CAGCGCAACG ACTXGATCAT TGTGGCXGCC CGXCCTTCAG TAGGGAAAAC 
********* ** ** « ** ^* ***** * ** 



dnac staph homolog 
dnac B subtilis 



XGCGTTC6CA CTTAATATTG CACAAAAAGT TGCAACGCAT GAAGATAT6X 
AGCCTXXGCC CTGAACATCG CACAAAACGT GGCGACGAAG ACCGAT---G 
**f ** ** ** ** ** * *****i^* ** ** **» * 
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dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B sxibtllla 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 

dnaC staph homolog 
dbaaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B subtilis 



dnaC staph homolog 
dnaC B stibtilis 



PCT/USOO/35180 



ATACAGTTGG TATXTXCTCG CTAGAGATGG GTGCT6ATCA GTTAGCCACA 
AGAGC6TAGC 6ATTTTCAG7 CTTGAGATGG GTGCOGAGCA GCTCGTTATG 
* * * ****** ** ****^** **** ******* * 

CGTATGATTT GTAGTTCTGG AAATGTX6AC TCAAACCGCT TAAGAACGGG 
CGTATGCTCT GTGCC6AGGG AAATATCAAT GCCCAGAATC TCCGTACAGG 
****** * * ** ** **** * * * * ****** 

TACTATGACX GAGGAAGATT GGAGTCGTTT TACTATAGCG GTAGGTAAAT 
TAACCTGACC GAAGAGGATt GGGGCAAGCT GACGATGGCA ATGGGAAGCC 
** **** ** ** **** ** * * ** ** ** * ** * 

TATCACGTAC GAAGATTTTT ATTGATGATA CACCGGGTAT TOSAATTAAT 
TArCGAACAG CGGGAXrXAC ATCGATGATA CACCGGGTAT TCGAGTGA6T 
**** * ***** ** ******* ********** **** * * * 

GATTrACGTX CXAAATGTCG TCGATXAAAG CAAGAACATG GCTTAGACAT 
GAAATCCGX6 CCAAGXGCCG CCGCXXGAAG CAGGAAAGCG GGCXGGGCAT 
** * *** ★ ** ** ** ** ** *** ** *** * * * * *** 

QATTGXGAXX GACXACXTAC AGTrGATTCA AGGTAGTGGX TCACGTGCGT 
GATTTT6ATC GATXACCTGC AATTGATTCA GGGAAGCGGT ---CGTTCAA 
**** **** ** *** * * * ******** ** ** *** *** * 

CCGATAACAG ACAACAGGAA GXTrCTGAAA TCTCTCGTAC ATTAAAAGCA 
AGGACAACCG TCAGCAGGAA GTATCTGAAA TTTCCCGTGA ACTGAAGTCG 
** *** * ** ****** ** ******* * ** *** * * ** * 

XXAGCCCGTG AATTAGAATG TCCAGTXATC GCATTAAGTC AGTTATCTCG 
AXXGCGAGGG AGCTGCAAGT CCCrCTTATC GCGCTTTCTC AGCTTTCCAG 
********* ** ****** ** * ******* ft 

X6GXGXXGAA CAACGACAAG AXAAACGXCC AAT6ATGAGT GATATTCGTG 
GGGTGTXGAG CAGCGTCAGG AXAAACGXCC GATGATGTCT GATATCCGGQ 
******** ** ** ** * ********** ****** * ***** ** * 

AATCXGGTTC GATXGA6CAA GATGCCGAXA TCGTTGCATT CTTATACCGT 
AAXCAGGAAG TATCGAGCAG GACGCGGATA TTGTCGCGTT CCTTTATCGT 
**** ** ** ***** ** ** **** ********* #« »** 

GATGATXACX AXAACCGTGG CGGCGATGAA GATGATGACG ATGATGGTGG 

GATGACTACT ATGAC 

***** **** ** ** 

TTTCGAGCCA CAAACGAATG ATGAAAACGG TGAAATXGAA ATTATCATTG 

AAAGAAACCX3 AGAATAAAAA XATTATCGAA ATTATXATCG 

** ****** « ** *** ***** ** * 

CXAAGCAACG TAACGGTCCA ACAGGCACAG XXAAGTTACA TTTTATGAAA 
CCAAACAGCG TAACGGCCCG GTAGGAACCG TGTCTCTTGC GTTCGTAAAA 
* ** ** ** ******* ** *** ** * * * ** * *** 

CAATATAATA AATTTACCGA --TATCGATT AXGCACAXGC 

GAATACAACA AATTCGTCAA CCXGGAACGG CGTTXTGATG ACQCAGGCGX 
**** ** * **** * * * * * *** * 

AGATATGATQ TAA 
TCOGCCOGGC GCA 
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SEQ ID NO: 8 



1 


MTDLLNDRLP 


51 


lYNAMLVLGD 


101 


NIEYYAKIVE 


151 


AQRKNTSAFQ 


201 


QRNDLIIVAA 


251 


MLCAEGNINA 


301 


IRAKCRRIiKQ 


351 


RELQVPVIAL 


401 


yyDKETENIOJ 


451 


PPGA 



C B. subtilis 

PQNIEAEQAV LGAIFLQPSA 
RGEPVDLVTV TSELANTDIiI, 
EKSILRRIiIR TATTIAQDGY 
NIKDVLVQTY fiNIEQIiYNRK 
RPSVGKTAPA LWrAQNVATK 
QNLRTGNIiTB EDWGKLTMAM 
ESGLGMILID YLQLIQGSGR 
SQLSRGVEQR QDKRPMMSDI 
IIEIIIAKQR NGPVGTVSLA 



LTIiASEVLIP DDFYRMSHQK 
EEVGGISYLT DIANSVPTAA 
TREDEVEDLL SEAEKTIMEV 
GDITGIPTGP TELDRMTAGF 
TDESVAIFSL EMGAEQLVMR. 
GSLSNSGIYI DDTPGIRVSE 
SKDNRQQEVS EISRELKSIA 
RESGSIEQDA DIVAFLYRDD 
FVKEYNKFVN LERRFDDAGV 



SEQ XD NO: 9 DnaC S. aureus 

1 MDRMYEQNQM PHNNEAEQSV LGSIIIDPEIj IWTTQEVLLP ESFYRGAHQH 

51 IFRAMMHLNE DNKEIDWTL MDQIiSTEGTL NEAGGPQYLA ELSTNVPTTR 

101 NVQYYTDIVS KHALKRRLIQ TABSIAOTGY UDBIiELDAIIi SDAERRILEL 

151 SSSRESDGFK DIRDVLGQVY ETAEEIiDQNS GQTPGIPTGY RDLDQMTAGF 

201 NRNDLIIIiAA RPSVGKTAPA IiNIAQKVATH EDMYTVGIPS LEMGADQLAT 

251 RMICSSGNVD. SNRIiRTGTMT BEDWSRFTIA VGKIiSRTKIF IDDTPGIRIN 

301 DLRSKCRRLK QEHGLDMIVI DYLQLIQGSG SRASDNRQQE VSEISRTLKA 

351 LARELECPVI ALSQLSRGVE QRQDKRPMMS DIRESGSIEQ DADIVAFLYR 

401 DDYYNRGGDE DDDDDGGFEP QTNDENGEIB IIIAKQRNGP TGTVKIiHFMK 

451 QYNKFTDIDY AHADMM 



dnaC amino acid sequences alignment 

Identical « 265/471 (0.56), similar = 345/471 (0.73) 

Subsfcikution mEitrixt blosum€2 

Gap penalty: - (XX + X * (gap length)) 

dnaC Staph homolog MDRMYEQNQM PHNMEAEQSV LGSiriDPEr. XWTTQEVLIiP ESFYRGAHQH 

dnaC B subtilis MTDLLNDRLP PQNIEAEQAV LGAIFLQPSA IiTLASEVIiIP DDFYRMSHQK 

* + * * w***+w ^ * ^ ***^.# ^ *** 4.** 

IPRAMMHLME DWKElDWTIi MDQtiSTEGTti NEAGGPQYLA EIiSTNVPTTR 
IYNAMLVIX3D RGEPVDIiVTV TSErANTDLIi EEVQGISYIiT DIANSVPTAA 
*#+ * 4. 4. 4.*4.**4. 4.*^ * It ** 4.4.4. 4.#*-* 

NVQYYTDIVS KHALRRRLIQ TADSIANDGY JTOELEXiDArL SDAERRILEL 
NIEYYAKIVE EKSXLRRLIR TATXIAQDGY TREDEVEDLIi SEAEKTIMEV 
*4,4.** ** 4. 4,^ ****4. .J.** « i,^^ * + 

SSSRESDGFK DIRDVLGQVY ETAEELDQNS GQTPGIPTGY RDLDQMTAGF 
AQRKNTSAFQ NIKDVLVQTY DNIEQLYMRK GDITGIPTGP TELDRMTAGF 
+ + + *+ 4'*+*** * * 4. *4'* ♦ *****4. 4.**'4.***** 

WRNDLIILAA RPSVGKTAPA LNIAQKVATH EDMYTVGIFS LEMGADQLAT 
QRNDLIIVAA RPSVGKTAFA LNIAQNVATK TD-BSVAIFS LEMGAEQLVM 

RMICSSGNVD SNRLRTOTHT EEDWSRFTIA VGKLSRTKIP IDDTPGIRIN 
RMLCAEGNIN AQNLRTGNLT EEDWGKLTMA MGSLSMSGIY IDDTPGIRVS 
**+*4. **4.+ 4- **** +* **♦* 4- *+* +* 4. *4. *******♦+•♦• 



dnaC Staph homolog 
dnaC B subtilis 



dnaC Staph homolog 
dnaC B sxibtilia 



dnaC Staph homolog 
dnaC B subtilis 



dnaC Staph homolog 
dnaC B subtilis 



dnaC Staph homolog 
dnaC B subtilis 
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dnaC Staph homolog 
dnaC B subtil is 



dnaC Staph homolog 
dnaC B ^subtilia 



dnaC Staph homolog 
dnaC B subtil is 



dnaC Staph homolog 
dnaC B suhtilis 



DIjRSKCRRLK QBHGIiDMIVI DYLQLIQGSO SRASDNRQQE VSEISRTLKA 
EIRAKCRRLK QESGLGMItil DYLQLIQGSG -RSKDHRQQE VSEISREbKS 
♦* *# ********** ****** ****** **+ 

XARELBCPVI ALSQLSRGVE QRQOKRPMMS DIRESGSIEQ OADIVAFLYR 
lARELQVPVI AIiSQIiSRGVE QRQDKRPMMS DIRESGSIBQ DADIVAFLYR 
^****^ ♦** ********** ********** ********** ********** 

DDYYNRGGDE DDDDDGGFEp QTWmGElt IIIAKQRNGP TGTVKLHFMK 

DDVYDK ET^JKM-IIE IIIAKQRNGP VGTySLAFVK 

****^+ +* ++♦ ** ********** *** * *+* 

QVNKFTDIDy AHADM M 

EYNKFVNLER RPDDAGVPPG A 
+*»** +++ * 
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Bacterial Growth kinetics 



1000 




1 4-1- 



I ' r — r 1 ( p~ 



0 40 BO 120 160 200 240 
Time (mm.) 



Non killer ORF, no 
induction 

Non fciller ORF, 
induction 

77pTORF104, no 
induction 

77pTORF104, 
induction 
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ic,. 



?6 

GST 

ACBO O.i 0.5 '1.0 2.0 



GST/ORF104 

Mr ACBO 0.1 0.5 1.0 2.0 



ii 
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'3' GST' GST/ORF104 

Mr ACB 0 0.1 0.5 1.0 2.0 ACB 0 0.1 0.5 1.0 2.0 




9 

i 

i 



•1! 





■ •* ■ -i 
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GST GST/ORF104 
Mr ACB 0 0.1 0.5 1.0 2.0 Mr_ACB 0 0.1 03 1.0 2.0 

I' ~ 



97 
66 

45 



30 



21 



15 



'3> 



28/39 



wo 01/46383 



PCT/USOO/35180 




29/39 



wo 01/46383 PCT/USOO/35180 

F»5.|o 

Lys Extract FP'S Extract 
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800 tooo tabo 1400 icoa ten 2oas 



ii) • Tiyptic peptide mass spectrum of interacting protein ( I % SDS eluate) 




1300 



2300 



27 



31/39 



wo 01/46383 



Staphylococcus awreusgftnomic DNA 

PCRwith 
dnal-specific 
primers 

EcoRI t ^ BamW 



EcoRI 



\ 



Digestion with 
EcoRfand BamHI 



BamHI 



c- 



1 



Htndili 



Subcloning into 
pOADT7 vector 



B- 



PCT/USOO/35180 



Phage 77 DNA 



BamHI 



PCRwith 

77ORF104-specific 
primers 



BamHI 



Digestion with 
BamKI and FstI 




I 



HindUI. 



Subcloning into 
pGBKT7 vector 



6AL4-0NA^BD 




77ORP)04 
_ rtop 
PADHI P»M 




pGBK770RP1(M \ 
7.5 kb py^^oA:' 




► No transcription 



Reporter genes: LacZ, HIS3 and ADE2 




No transcription 



Reporter genes: LacZ, ^ZJi and ADE2 




Reporter genes: LacZ,HIS3 and v4Z)^2 
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1) pG8KT7-53arKJ pGADTT-T 

2) pG8tCT7-53 ar>d pGAO dna I 

3) pG8K77ORFl04and pGADT7-T 

4) pGBKTr-UVM and pCLl 

5) PQBJC77ORF104 and pGAD dna I 



E) 
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FIGURE 14A 



Mr 



Endoproteinase Glu-C 

ACB 

Omg/ml 2.0ng/ml 2.0mg/ml 
L FT 1 2 FT I 2 1 



m 

97 [,Vr?- 

66 f 



45 '."•.f 



8S 



30 - » 



tV25 
>V24 



201 



15: #16(->^|J 
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FIGURE 14B 



Chymotrypsin 
Omg/ml ZOng^ml 
L FT 1 2 FT 1 2 



«>V # *-<* Ml i» 



t m 



^2 
-*C21 

C20 



C16(-) 
C15(-) 



C14 
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FIGURE 14C 



Amino acid residues corresponding to interacting partial 
proteolytic fragments. 



Protease 


Proteolytic 
fragment ID 
(from Fig. 14A, B) 


ID of SEQIDN0:2 
fragment Interacting 
with 77ORF104 






from amino) to carboxyl 



Endoproteinase Glu-C 


V24 


117 


313 


V24 


119 


313 




Cliymotrypsin 


C38 


12 


313 


C25 


83 


313 


C24 


77 


305 


C23 


77 


304 


C22 


116 


313 


C21 


131 


313 





SEQIDN0:2 


1 Complete I 


1 


1 313 1 
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FIGURE 15 



SEQIDN0:16 

>S, aureus dnal : amino acid 150-313 

AADDICTAITNGEQVKGLYLYGPFGTGKSPILGAIANQLKSKK7RST1IYLPEFIRTLKG 

GFKDGSPEmiHRVREANIIiMLbDIGAEEVTPWVRDEVIGPLLHyRM^ 

YSELKHHLAMTRDGEEKTKAARIIERVKSLSTPYFLSGENFRNN 

SEQIDNO;17 

>S, aureus dnal : nucleotide 448-942 

gcagcagatgatatttgtacagcaataactaatggggaacaagtgaaaggcctttacctt 
tatggtccatttgggacaggtaaatcttttattctaggtgcaattgcgaatcagcfccaaa 
tctaagaaggtacgttcgacaattatttatttaccggaatttattagaacattaaaaggt 
ggctttaaagatggttcttttgaaaagaaattacatcgcgtaagagaagcaaacatttta 
atgcttgatgatattggggctgaagaagtgactccatgggtgagagatgaggtaattgga 
cctttgctacattatcgaatggttcatgaattaccaacattctttagttctaattttgac 
tatagtgaattggaacatcatttagcgatgactcgtgatggtgaagagaagactaaagca 
gcacgtattattgaacgtgtcaaatctttgtcaacaccatactttttatcaggagaaaat 
t tcagaaacaa t tga 



SEQID NO: 18 

>S, aureus dnal :amino acid 64-313 

yKDQQKHYDGHKFADCPNFVKGHVPELYVDWNRIKIRYLQCPCKrKYDEERFEAELITSHH 

MQRDTimKLKDIYMNHRDRLDVAMAADDlCTAITOGEQVKGLYLYGPFGTC^ 

AWQLKSKKVRSTIIYLPEPIRTIiKGGFKDGSFEKKLHRVREANILMLDDIGAEEVTPWVR 

DEVIGPLLKYRM\raELPTFFSSNFDYSELEHHIiAMTRDGEEKTKAARXIERVKSIiSW 

LSGENFRNN 
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FIGURE 16 




B 



Dnal fragments and 77ORF104 



2 6 



1. pGADDnattlSO-ata) «nil pGBKORFi04 
1 pGAt)On«}(1 W-3f 3) ind pOBKLvm 

4^ pGAODnal(64-3t3) and pCSKUm 
B. pGAODnat and paBKORF104 
& 77|}OA0OnF13aiFd pCSKtOI 




TL minus SD medium 



THAL minus SD medium 



SEQ ID NO: 2 1^^^^^^^^^^^^^^^?^^^^ ^ 13 

SEQIDN0:18 \mm^y:^h':^^B^i^^s^^mz^^^ 

SEQ ID NO: 16 



Interaction with 
77 ORF 104 

yes 
yes 
yes 
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